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Mail Stop Appeal Brief - Patents 

COMMISSIONER FOR PATENTS 
P.O. Box 1450 
Alexandria, VA 22313-1450 

Dear Sir: 

The Applicants appeal the rejection of Claims 1-5 in the above-captioned patent 
application. These claims were rejected in a final Office Action dated July 5, 2005. In response 
to Applicants' Amendment and Response to Final Office Action, the Examiner issued an 
Advisory Action dated September 21, 2005. Applicants mailed a Notice of Appeal October 4, 
2005. 

I. REAL PARTY IN INTEREST 

Pursuant to 37 C.F.R. 41.37(c)(1), Appellants hereby notify the Board of Patent Appeals 
and Interferences that the real party in interest is the assignee of record for this application, 
Genentech, Inc., 1 DNA Way, South San Francisco, CA 94080. 
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II. RELATED APPEALS AND INTERFERENCES 

A Notice of Appeal has been filed in the related Application Nos. 10/063,570; 
10/063,591, 10/063,617; 10/063,640; and 10/063,713. A Notice of Appeal and an Appeal Brief 
have also been filed in the related Application Nos. 10/063,519; 10/063,530; 10/063,534; 
10/063,540; 10/063,578; 10/063,584; 10/063,586, 10/063,587; 10/063,592; 10/063,616; 
10/063,648; 10/063,652; 10/063,653; 10/063,659; 10/063,660; and 10/063,661. Appellants are 
unaware of any other related appeals or interferences. 

III. STATUS OF THE CLAIMS 

The above-captioned application was filed with Claims 1-6. Claim 6 was cancelled and 
Claim 1 was amended in an Amendment and Response to Office Action mailed April 29, 2005. 
Claims 1-5 were finally rejected by the Examiner in a final Office Action mailed July 5, 2005. 
Appellants mailed an Amendment and Response to Final Office Action amending Claim 1 
September 2, 2005. The Examiner issued an Advisory Action dated September 21, 2005 which 
indicated that the amendments in the Amendment and Response to Final Office Action were 
entered. Accordingly, Claims 1-5 are the subject of this appeal. The claims at issue are attached 
hereto as Appendix A. 

IV. STATUS OF AMENDMENTS 

Appellants mailed an Amendment and Response to Final Office Action on September, 2 
2005. The Examiner issued an Advisory Action dated September 21, 2005 which indicated that 
the amendments in the Amendment and Response to Final Office Action were entered. 

V. SUMMARY OF THE CLAIMED SUBJECT MATTER 

The claimed subject matter relates to isolated antibodies which specifically bind to the 
polypeptide of SEQ ID NO: 14. As amended in the Amendment and Response to Final Office 
Action, independent Claim 1 reads: 

1 . An isolated antibody that specifically binds to the polypeptide of SEQ ID NO: 14. 
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VI. GROUNDS OF REJECTION TO BE REVIEWED ON APPEAL AND GROUPING 
OF CLAIMS 

A. Grounds of Rejection on Appeal 

In the final Office Action, Claims 1-5 were rejected on the assertion that the claimed 
invention is not supported by either a specific and substantial asserted utility or a well- 
established utility. 

In the final Office Action, the Examiner has also rejected Claims 1-5 under 35 U.S.C. 
§112, first paragraph. The Examiner asserts that since the claimed invention is not supported by 
either a specific or substantial asserted utility or a well-established utility, one skilled in the art 
clearly would not know how to use the claimed invention. Claims 1-5 were also rejected under 
35 U.S.C. §112, first paragraph because the art is unpredictable, because a multitude of 
homeostatic factors affect transcription and translation such that the utility of a polypeptide as a 
diagnostic is not contingent on the levels of mRNA expression and because "absent evidence of 
the protein's expression including the correlation to a diseased state, one of skill in the art would 
be unable to predictably use the polypeptides in any diagnostic setting without undue 
experimentation." Final Office Action at 8. 

B. Grouping of Claims 

1. Utility Rejection - Claims 7-5 

For the purposes of the utility rejection under 35 U.S.C. §101, Claims 1-5 can be 
considered as a group. 

2. Enablement Rejection - Claims 1-5 

For the purposes of the enablement rejection under 35 U.S.C. §112, first paragraph. 
Claims 1-5 can be considered as a group. 
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VII. APPELLANTS^ ARGUMENT 
A. Summary of Arguments 

1. Utility Rejection 

One issue before the Board is whether Appellants have asserted at least one "specific, 
substantial, and credible utility" for the claimed invention. See Examination Guidelines ("Utility 
Guidelines"), 66 Fed. Reg. 1092 (2001). Appellants have asserted that the claimed antibodies are 
useful as diagnostic tools for cancer, particularly for melanoma. This asserted utility is specific, 
substantial, and credible. 

Briefly stated, Appellants' asserted utility is based on the disclosure in Example 18 of the 
instant application that the mRNA encoding the PRO 1864 polypeptide is more highly expressed 
in melanoma compared to normal skin tissue. It is well-established that there is a reasonable 
correlation between changes in mRNA level for a particular gene and a corresponding change in 
the level of expression of the encoded polypeptide, such that increasing or decreasing the amount 
of mRNA for a particular gene leads to a corresponding increase or decrease in the amount of the 
encoded protein. Thus, one of skill in the art would be more likely than not to believe that, like 
the PRO 1864 mRNA, the PRO 1864 protein is more highly expressed in melanoma compared to 
normal skin tissue. Because of this differential expression, antibodies which bind to the 
PR01864 polypeptide are useful for distinguishing melanoma from normal skin tissue. 
Therefore, the claimed antibodies have a specific, substantial and credible utility as diagnostic 
tools for cancer, particularly melanoma, as is explained in more detail below. 

2. Enablement Rejection 

An additional issue before the Board is whether Appellants have enabled the pending 
claims such that one of skill in the art would be able to make and use the claimed invention. In 
the final Office Action, the Examiner has rejected Claims 1-5 under 35 U.S.C, §112, first 
paragraph, arguing that because the claimed invention is not supported by either a specific or 
substantial asserted utility or a well-established utility, one skilled in the art would not know how 
to use the claimed invention. Final Office Action at 8. 
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For the reasons discussed in detail below, the claimed invention is supported by a 
specific, substantial and credible utility. Accordingly, Appellants maintain that the specification 
enables one skilled in the art to make and use the claimed antibodies. 

In addition, the Examiner maintains that the specification is not enabling because the art 
is unpredictable, because a multitude of homeostatic factors affect transcription and translation 
such that the utility of a polypeptide as a diagnostic is not contingent on the levels of mRNA 
expression and because "absent evidence of the protein's expression including the correlation to 
a diseased state, one of skill in the art would be unable to predictably use the polypeptides in any 
diagnostic setting without undue experimentation." Final Office Action at 8. 

Briefly stated, Appellants maintain that the references cited by the Examiner do not 
contravene Applicants position that, in general, differential mRNA expression leads to 
differential polypeptide expression. In addition, the specification discloses how to make and use 
the claimed antibodies. In addition, methods for making and using antibodies are well known in 
the art. See In re Wands, 858 F.2d 731 (reversing the Board's decision of non-enablement and 
holding that as of 1980, undue experimentation was not required to make high-affinity 
monoclonal antibodies to a target peptide). Accordingly, one skilled in the art would be able to 
make and use the claimed antibodies without undue experimentation. 

Furthermore, Appellants maintain that, in view of the differential expression of the 
PRO 1864 polypeptide, one of skill in the art would be able to use the claimed antibodies as 
diagnostic tools regardless of whether or not the PRO 1864 polypeptide is the causative agent of 
melanoma. 

B. Utility Rejection - Detailed Arguments 

The first issue before the Board is whether Appellants have asserted at least one "specific, 
substantial, and credible utility." See Examination Guidelines, 66 Fed. Reg. 1092 (2001). 
Appellants have asserted that the claimed antibodies which bind to the polypeptide of SEQ ID 
NO: 14 (the PRO 1864 polypeptide) are useful as diagnostic tools for cancer, particularly for 
melanoma. This asserted utility is specific, substantial, and credible, as is explained in more 
detail below. 
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1. Utility - Lesal Standard 

A "specific utility" is defined as utility which is "specific to the subject matter claimed," 
in contrast to "a general utility that would be applicable to the broad class of the invention." See 
M.P.E.P. § 2107.01 I. For example, it is generally not enough to state that a nucleic acid is 
useful as a diagnostic tool without also identifying the condition that is to be diagnosed. 

The requirement of "substantial utility" defines a "real world" use, and derives fi-om the 
Supreme Court's holding in Brenner v. Manson, 383 U.S. 519, 534 (1966) stating that "[t]he 
basic quid pro quo contemplated by the Constitution and the Congress for granting a patent 
monopoly is the benefit derived by the public from an invention with substantial utility." In 
explaining the "substantial utility" standard, M.P.E.P. § 2107.01 cautions, however, that Office 
personnel must be careful not to interpret the phrase "immediate benefit to the public" or similar 
formulations used in certain court decisions to mean that products or services based on the 
claimed invention must be "currently available" to the public in order to satisfy the utility 
requirement. "Rather, any reasonable use that an applicant has identified for the invention that 
can be viewed as providing a public benefit should be accepted as sufficient, at least with regard 
to defining a 'substantial' utility." M.P.E.P. § 2107.01 (emphasis added). 

Indeed, the Guidelines for Examination of Applications for Compliance With the Utility 
Requirement, set forth in M.P.E.P. § 2107 11(B)(1) gives the following instruction to patent 
examiners: "If the applicant has asserted that the claimed invention is useflil for any particular 
practical purpose . . . and the assertion would be considered credible by a person of ordinary skill 
in the art, do not impose a rejection based on lack of utility." 

Finally, in assessing the credibility of the asserted utility, the M.P.E.P. states that "to 
overcome the presumption of truth that an assertion of utility by the applicant enjoys" the PTO 
must establish that it is "more likely than not that one of ordinary skill in the art would doubt 
(i.e., 'question') the truth of the statement of utility." M.P.E.P. § 2107.02 III A. 

2. Utility - Burden of Proof 

It is well established that a specification which contains a disclosure of utility which 
corresponds in scope to the subject matter sought to be patented "must be taken as sufficient to 
satisfy the utility requirement of § 101 for the entire claimed subject matter unless there is reason 
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for one skilled in the art to question the objective truth of the statement of utility or its scope." In 
re Langer, 503 F.2d 1380, 1391, 183 U.S.P.Q. 288, 297 (C.C.P.A. 1974). Thus "the PTO has the 
initial burden of challenging a presumptively correct assertion of utility in the disclosure." In re 
Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995). Only after the PTO provides 
evidence showing that one of ordinary skill in the art would reasonably doubt the asserted utility 
does the burden shift to the applicant to provide rebuttal evidence sufficient to convince such a 
person of the invention's asserted utility. Id 

3. Utility - Standard of Proof 

Compliance with 35 U.S.C. § 101 is a question of fact. Raytheon v. Roper, 724 F.2d 951, 

956, 220 U.S.P.Q. 592, 596 (Fed. Cir. 1983). The evidentiary standard to be used throughout ex 

parte examination in setting forth a rejection is a preponderance of the evidence, or "more likely 

than not" standard. In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d 1443, 1444 (Fed. Cir. 

1992). This is stated explicitly in the M.P.E.P.: 

[T]he applicant does not have to provide evidence sufficient to establish that an 
asserted utility is true "beyond a reasonable doubt." Nor must the applicant 
provide evidence such that it establishes an asserted utility as a matter of 
statistical certainty. Instead, evidence will be sufficient if, considered as a whole, 
it leads a person of ordinary skill in the art to conclude that the asserted utility is 
more likely than not true . M.P.E.P, § 2107.02, part VII (emphasis in original, 
citations omitted). 

The Court of Appeals for the Federal Circuit has stated that the standard for satisfying the 

utility requirement is a low one: 

The threshold of utility is not high : An invention is "useful" under section 101 if 
it is capable of providing some identifiable benefit. See Brenner v. Manson, 383 
U.S. 519, 534, 86 S.Ct. 1033, 16 L.Ed.2d 69 (1966); Brooktree Corp, v. Advanced 
Micro Devices, Inc., 977 F.2d 1555, 1571 (Fed. Cir. 1992) ("To violate § 101 the 
claimed device must be totally incapable of achieving a useful result"); Fuller v. 
Berger, 120 F. 274, 275 (7th Cir. 1903) (test for utility is whether invention "is 
incapable of serving any beneficial end"). Juicy Whip, Inc. v. Orange Bang, Inc, 
185 F.3d 1364, 1366, 51 U.S.P.Q. 2d 1700 (Fed. Cir. 1999) (emphasis added). 

The low threshold for satisfying the utility requirement is reflected in the standard set by the 
Federal Circuit for invalidating a patent based on a lack of utility: "[T]he fact that an invention 
has only limited utility and is only operable in certain applications is not grounds for finding lack 
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of utility. Some degree of utility is sufficient for patentability. Further, the defense of non-utility 

cannot be sustained without proof of total incapacity ." Envirotech Corp. v. Al George, Inc., 730 

F.2d 753, 762, 221 U.S.P.Q. 473 (Fed. Cir. 1984) (emphasis added, citations omitted). 

Because the standard for satisfying the utility requirement is so low, requiring total 

incapacity for a finding of no utility, the M.P.E.P. cautions that: 

Rejections under 35 U.S.C. 101 have been rarely sustained by federal courts. 
Generally speaking, in these rare cases, the 35 U.S.C. 101 rejection was sustained 
[] because the appHcant . . . asserted a utility that could only be true if it violated a 
scientific principle, such as the second law of thermodynamics, or a law of nature, 
or was wholly inconsistent with contemporary knowledge in the art. M.P.E.P. § 
2107.02 III B., citing In re Gazave, 379 F.2d 973, 978, 154 U.S.P.Q. 92, 96 
(C.C.P.A. 1967) (underline emphasis in original, italic emphasis added). 

4. Appellants Asserted a Specific, Substantial and Credible Utility that is 
Sufficient to Satisfy the Utility Requirement ofS 101 

The claimed subject matter relates to isolated antibodies which specifically bind to the 
polypepfide of SEQ ID NO: 14. The polypeptide of SEQ ID NO: 14 (referred to as "PR01864 
polypeptide") is encoded by the polynucleotide of SEQ ID NO: 13 (also referred to as 
DNA45409-2511). Specification at t1 [0159-0160], Appellants have asserted that the claimed 
antibodies are useful as diagnostic tools for cancer, particularly melanoma. 

In "Example 18: Tumor Versus Normal Differential Tissue Expression Distribution" 
Appellants disclose that the mRNA encoding PRO 1864 polypeptide is more highly expressed in 
melanoma compared to normal skin tissue. Specification at ^ [0529-0530] and accompanying 
tables. As explained in paragraph [0530], the differential expression of the PRO 1864 mRNA 
was detected using the well-established technique of quantitative PGR amplification of cDNA 
libraries isolated from different human normal and tumor tissue samples. To ensure that 
equivalent amounts of nucleic acid were used in each reaction, the cDNA for P-actin was used as 
a control. 

The specification teaches that identification of the differential expression of a PRO 
polypeptide-encoding mRNA in one or more tumor tissues as compared to one or more normal 
tissues of the same tissue type "renders the molecule useful diagnostically for the determination 
of the presence or absence of tumor in a subject suspected of possessing a tumor." Specification 
at ^ [0530]. Because it is well established that changes in mRNA levels lead to changes in the 
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level of the encoded protein, one would expect the PRO 1864 protein to be differentially 

expressed in melanoma. The specification states that PRO polypeptides "may also be used 

diagnostically for tissue typing, wherein the PRO polypeptides of the present invention may be 

differentially expressed in one tissue as compared to another, preferably in a diseased tissue as 

compared to a normal tissue of the same tissue type." Specification at Tf [0336]. Likewise, the 

specification discloses the use of antibodies to PRO polypeptides as diagnostic tools: 

[A]nti-PRO antibodies may be used in diagnostic assays for PRO [polypeptide], 
e,g., detecting its expression (and in some cases, differential expression) in 
specific cells, tissues, or serum. Various diagnostic assay techniques known in the 
art may be used, such as competitive binding assays, direct or indirect sandwich 
assays and immunoprecipitation assays conducted in either heterogeneous or 
homogeneous phases. Specification at [0407]. 

Taken together, the specification clearly discloses the use of the claimed antibodies as 
diagnostic tools for cancer, particularly melanoma. This utility is substantial, as one of skill in 
the art will recognize that the diagnosis of cancer is a "real world" use; it is specific, as the 
diagnosis of melanoma is not a utility that applies to the broad class of antibodies; and it is 
credible, as it is not a utility "that could only be true if it violated a scientific principle, ...or a 
law of nature, or [is] wholly inconsistent with contemporary knowledge in the art." M.P.E.P. § 
2107.02 III B., citing In re Gazave, 379 F.2d 973, 978, 154 U.S.P.Q. 92, 96 (C.C.P.A. 1967). 
Because Appellants' specification contains a disclosure of utility which corresponds in scope to 
the claimed subject matter, the asserted utility "must be taken as sufficient to satisfy the utility 
requirement of § 101 for the entire claimed subject matter unless there is reason for one skilled in 
the art to question the objective truth of the statement of utility or its scope." In re Langer, 503 
F.2d 1380, 1391, 183 U.S.P.Q. 288, 297 (C.C.P.A. 1974). Therefore, the burden of establishing 
a prima facie case of lack of utility rests with the PTO. See, In re Brana, 51 F.3d 1560, 1566, 34 
U.S.P.Q.2d 1436 (Fed. Cir. 1995) ("the PTO has the initial burden of challenging a 
presumptively correct assertion of utility in the disclosure"). 

5. The Examiner's Arsuments 

In the first Office Action, dated January 31, 2005, the Examiner rejected the pending 
claims, stating "Claims 1-6 are rejected under 35 U.S.C. 101 because the claimed invention is not 
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supported by either substantial asserted utility or a well established utility." First Office Action 
at 4. This rejection is maintained in the final Office Action. Office Action at 3. 

To establish a prima facie showing that the claimed subject matter lacks utility, the 
Examiner must "provide[] evidence showing that one of ordinary skill in the art would 
reasonably doubt the asserted utility." In re Brana, 51 F.3d 1560, 1566, 34 U.S,P.Q.2d 1436 
(Fed. Cir. 1995). The Examiner has issued a first Office Action, a final Office Action, and an 
Advisory Action during the prosecution of the instant application. None of these papers provide 
any evidence that one of ordinary skill in the art would reasonably doubt the asserted utility. 

The Examiner states that "The response argues that the identification of the differential 
expression of the PRO polypeptide-encoding nucleic acid in melanoma compared to normal skin 
tissue and that it is well established that the change in the level of mRNA encoding a particular 
protein generally leads to a change in the corresponding protein and given that the PRO 1864 
polypeptide is increased in melanoma compared to normal skin it is likely that the pro 
polypeptide is differentially expressed and can be used as a diagnostic tool Final Office 
Action at 3. However, the Examiner rejects this utility, stating that "the examiner cited numerous 
art that did show mRNA does not correlate with protein. " Final Office Action at 4. 

First, the Examiner challenges the sufficiency of the data presented in Example 18. In the 
first Office Action, the Examiner cited Hu et al as teaching that "for genes displaying a 5-fold 
change or less in tumors compared to normal, there was no evidence of a correlation between 
altered gene expression and a known role in the disease." First Office Action at 6-7. In the final 
Office Action, the Examiner states "The response states that the general findings of Hu are not 
surprising and the strongest known relationship to disease is measured by the greater change in 
expression level and Hu's methodology yields results that prove little or no information 
regarding the biological significance of genes with less than 5 fold expression change (see page 
20-22 of response). In response to this, the argument does not present any evidence that under 5 
fold would be useful for diagnosis as asserted because the PRO 1864 is only overexpressed at 2 
fold." Final Office Action at 5. 

The Examiner cites G6kman-Polar et al. as teaching that there is no necessary correlation 
between increased mRNA levels and increased protein levels. Final Office Action at 4-5. In 
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addition, the Examiner maintains that Haynes et al and Gygi et al indicate that there is no 
correlation between mRNA levels and protein levels. Final Office Action at 4-5. 

The Examiner also asserts that the Alberts, Lewin and Zhigang references previously 
submitted by Appellants in support of their position that mRNA levels correlate with protein 
levels actually support the Examiner's position that there is no such correlation. Final Office 
Action at 4. 

In the Advisory Action mailed September 21, 2005, the Examiner also asserts that the 
asserted utilities are not substantial because "the data are set forth in the specification are 
preliminary at best because the specification does not teach the expression of the PRO 1864 
polypeptide nor any particular biological activity of the polypeptide to which the claimed 
antibody binds." Advisory Action page 2. The Examiner also maintains that there are distinct 
regulatory factors at the transcriptional and translational levels and that one of skill in the art 
would not assume that mRNA levels are predictive of polypeptide levels. 

6. The Examiner has not established a Prima Facie case that Claims 1-5 lack 
Utility 

The above arguments do not satisfy the Examiner's burden to "provide[] evidence 
showing that one of ordinary skill in the art would reasonably doubt the asserted utility." In re 
Brana, 51 F,3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995). The Examiner has the burden 
of presenting "countervailing facts and reasoning sufficient to establish that a person of ordinary 
skill would not believe the Appellant's assertion of utility." M.P.KP, at §2107.02 III.A., citing 
in re Brana, 51 F,3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995) ("Only after the PTO 
provides evidence showing that one of ordinary skill in the art would reasonably doubt the 
asserted utility does the burden shift to the Appellant to provide rebuttal evidence") (emphasis 
added). With the exception of the specific references cited by the Examiner, the Examiner's 
assertions are not supported by any facts, evidence, or reasoning. Instead, the Examiner simply 
makes unsupported assertions. As for the references cited by the Examiner, for the reasons 
discussed below, they do not support the Examiner's position. Therefore there is simply no 
evidence in the record to support the Examiner's assertion that Appellants' asserted utility is not 
substantial, and the invention is incomplete. Absent some substantial evidence to support his 
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assertions, the Examiner has failed to establish a prima facie showing that one of skill in the art 
would reasonably doubt the asserted utility, and the Board should accept Appellants' disclosed 
utility as sufficient. 

a. The data in Example 18 are sufficient to establish the asserted utility 

Appellants turn first to the Examiner's arguments challenging the reliability of the data 
reported in Example 18. The Examiner argues that Example 18 and the first declaration of 
Mr. Grimaldi are insufficient to overcome the utility rejection. In the first Office Action, the 
Examiner cited Hu et al as teaching that "for genes displaying a 5-fold change or less in tumors 
compared to normal, there was no evidence of a correlation between altered gene expression and 
a known role in the disease." First Office Action at 6-7. In the final Office Action, the Examiner 
argues that Appellants arguments were insufficient to overcome this rejection. 

As an initial matter, Appellants note that, other than the Examiner's assertions based on 
Hu et al, the remainder of the objections are not supported by any evidence or reasoning as to 
why the data in Example 18 are insufficient, and therefore they cannot establish a prima facie 
case. See In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995) ("Only after the 
PTO provides evidence showing that one of ordinary skill in the art would reasonably doubt the 
asserted utility does the burden shift to the Appellant to provide rebuttal evidence.") (emphasis 
added). Despite this deficiency, Appellants address the Examiner's objections below. 

The gene expression data in Example 18 of the specification show that the mRNA 
associated with protein PRO 1864 was more highly expressed in melanoma compared to normal 
skin tissue. See Specification at \ [0530] and accompanying tables. Gene expression was 
analyzed using standard quantitative PGR amplification reactions of cDNA libraries isolated 
fi"om different human tumor and normal human tissue samples. Id It is well known in the art 
that the number of copies of a particular cDNA in the cDNA library is determined by the number 
of copies of the corresponding mRNA in the sample. Therefore, the cDNA libraries can be used 
to determine the level of expression of the corresponding mRNA in the tissue. 

Appellants have asserted that identification of the differential expression of the PRO 1864 
polypeptide-encoding gene in tumor tissue compared to the corresponding normal tissue renders 
antibodies against the polypeptide useful as a diagnostic tool for the determination of the 
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presence or absence of tumor. Id, In support of this asserted utility, Appellants submitted on 
April 29, 2005 as Exhibit 1 to their Amendment and Response to Office Action a first 
Declaration of J. Christopher Grimaldi, an expert in the field of cancer biology. This declaration 
explains the importance of the data in Example 18, and how differential gene and protein 
expression studies are used to differentiate between normal and tumor tissue. See First Grimaldi 
Declaration, 

In paragraphs 6 and 7, Mr. Grimaldi explains that the semi-quantitative analysis 
employed to generate the data of Example 18 is sufficient to determine if a gene is over- or 
under-expressed in tumor cells compared to corresponding normal tissue. He states that any 
visually detectable difference seen between two samples is indicative of at least a two-fold 
difference in cDNA between the tumor tissue and the counterpart normal tissue. He also states 
that the results of the gene expression studies indicate that the genes of interest "can be used to 
differentiate tumor from normal." He explains that "[t]he precise levels of gene expression are 
irrelevant; what matters is that there is a relative difference in expression between normal tissue 
and tumor tissue." First Grimaldi Declaration at ^ 7. 

This declaration makes clear that since it is the relative level of expression between 
normal tissue and suspected cancerous tissue that is important, how high the level of expression 
in normal tissue is, is irrelevant. Mr. Grimaldi also states that if a difference is detected using 
these techniques, "this indicates that the gene and its corresponding polypeptide and antibodies 
against the polypeptide are useful for diagnostic purposes..." Id. Mr. Grimaldi's Declaration 
also states that the samples are pooled samples of normal and tumor tissue, and therefore are 
more reliable than individual samples. Id. at Tf 5. Thus, it is the uncontested opinion of an expert 
in the field that the results are reliable enough to indicate that the PRO 1864 gene, the PRO 1864 
polypeptide and the claimed antibodies are useful as diagnostic tools. 

In the first Office Action, the Examiner cited Hu et al as teaching that "for genes 
displaying a 5 -fold change or less in tumors compared to normal, there was no evidence of a 
correlation between altered gene expression and a known role in the disease." First Office Action 
at 6-7. In the final Office Action, the Examiner states "The response states that the general 
findings of Hu are not surprising and the strongest known relationship to disease is measured by 
the greater change in expression level and Hu's methodology yields results that prove little or no 
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information regarding the biological significance of genes with less than 5 fold expression 
change (see page 20-22 of response). In response to this, the argument does not present any 
evidence that under 5 fold would be useful for diagnosis as asserted because the PRO 1864 is 
only overexpressed at 2 fold." Final Office Action at 5. 

With respect to Hu et aL, Appellants maintain that these researchers used an automated 
literature-mining tool to summarize and estimate the relative strengths of all human gene-disease 
relationships published on Medline. They then generated a microarray expression dataset 
comparing breast cancer and normal breast tissue. Using their data-mining tool, they looked for 
a correlation between the strength of the literature association between the gene and breast 
cancer, and the magnitude of the difference in expression level. They report that for genes 
displaying a 5-fold change or less in tumors compared to normal, there was no evidence of a 
correlation between altered gene expression and a known role in the disease. See Hu at 411. 
However, among genes with a 10-fold or more change in expression level, there was a strong 
correlation between expression level and a published role in the disease. Id at 412. Importantly, 
Hu reports that the observed correlation was only found among estrogen receptor-positive 
tumors, not ER-negative tumors. Id. 

The general findings of Hu are not surprising - one would expect that genes with the 
greatest change in expression in a disease would be the first targets of research, and therefore 
have the strongest known relationship to the disease as measured by the number of publications 
reporting a connection with the disease. The correlation reported in Hu only indicates that the 
greater the change in expression level, the more likely it is that there is a published or known role 
for the gene in the disease, as found by their automated literature-mining software. Thus, Hu's 
results merely reflect a bias in the literature toward studying the most prominent targets, and 
reflect nothing regarding the ability of a gene that is 2-fold or more differentially expressed in 
tumors to serve as a disease marker. 

Hu acknowledges the shortcomings of this method in explaining the disparity in Hu's 
findings for ER-negative versus ER-positive tumors: Hu attributes the "bias in the literature" 
toward the more prevalent ER-positive tumors as the explanation for the lack of any correlation 
between number of publications and gene expression levels in less-prevalent (and, therefore, less 
studied) ER-negative tumors. Id. Because of this intrinsic bias, Hu's methodology is unlikely to 
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ever note a correlation of a disease with less differentially-expressed genes and their 
corresponding proteins, regardless of whether or not an actual relationship between the disease 
and less differentially-expressed genes exists. Accordingly, Hu's methodology yields results that 
provide little or no information regarding biological significance of genes with less than 5-fold 
expression change in disease. Nowhere in Hu does it say that a lack of correlation in their study 
means that genes with a less than five-fold change in level of expression in cancer cannot serve 
as a molecular marker of cancer. 

Appellants submit that a lack of known role for PRO 1864 in cancer does not prevent it or 
antibodies which bind to it from being useful as a diagnostic tool for cancer. There is a 
difference between use of a gene for distinguishing between tumor and normal tissue on the one 
hand, and establishing a role for the gene in cancer on the other. Genes with lower levels of 
change in expression may or may not be the most important genes in causing the disease, but the 
genes can still show a consistent and measurable change in expression. While such genes may or 
may not be good targets for further research, the genes, associated proteins and antibodies which 
bind to them can nonetheless be used as diagnostic tools. Thus, Hu does not refute the 
Appellants' assertion that antibodies which bind to the PRO 1864 polypeptide can be used as a 
cancer diagnostic tool because the polypeptide is differentially expressed in certain tumors. 

In fact, the PTO's own written policies recognize that the utility of a nucleic acid does not 
depend on the flinction of the encoded gene product. The Utility Examination Guidelines 
published on January 5, 2001 state: "In addition, the utility of a claimed DNA does not 
necessarily depend on the function of the encoded gene product. A claimed DNA may have a 
specific and substantial utility because, e.g. it hybridizes near a disease-associated gene or it has a 
gene regulating activity." (Federal Register, Volume 66, page 1095, Comment 14). Similarly, 
here the disclosed nucleic acids, as well as the encoded polypeptides and related antibodies, are 
useful for determining whether an individual has cancer regardless of whether or not they are the 
cause of the cancer. 

Appellants further note that the analgous standard for therapeutic utility of a compound 
does not require one to demonstrate that the compound can in fact treat a particular condition. 
The M.P.E.P. indicates that "the mere identification of a pharmacological activity of a compound 
that is relevant to an asserted pharmacological use provides an 'immediate benefit to the public' 
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and thus satisfies the utility requirement." M.P.E.P. §2701.01 (emphasis original). Here, the 
mere identification of altered expression in tumors is relevant to diagnosis of tumors, and, 
therefore, provides an immediate benefit to the public. 

The data in Example 18 and the first Grimaldi Declaration are therefore sufficient to 
establish the asserted utility, and the Examiner has not rebutted the presumption of utility that the 
Appellants' application is afforded. Mr. Grimaldi is an expert in the field who conducted or 
supervised the experiments at issue. His declaration is based on personal knowledge of the 
relevant facts at issue. Appellants' have reminded the Examiner that "Office personnel must 
accept an opinion from a qualified expert that is based upon relevant facts whose accuracy is not 
being questioned." M.P.E.P. § 2107 (emphasis added). In addition, declarations relating to 
issues of fact should not be summarily dismissed without an adequate explanation of how the 
declaration fails to rebut the Examiner's position. See in re Alton 76 F.3d 1 168 (Fed. Cir. 1996). 
The Examiner has offered no reason or evidence to reject either the underlying data or Mr. 
Grimaldi 's conclusions. Therefore, the Examiner should accept Mr. Grimaldi 's opinion with 
regard to his statement that "any visually detectable difference seen between two samples is 
indicative of at least a two-fold difference in cDNA between the tumor tissue and the counterpart 
normal tissue" and that the genes of interest "can be used to differentiate tumor from normal." 

With respect to the Examiner's statement that Appellants argument "does not present any 
evidence that under 5 fold would be useful for diagnosis as asserted because the PRO 1864 is 
only overexpressed at 2 fold" Appellants note that, as indicated in Mr. Grimaldi 's first 
declaration, the mRNAs identified as being differentially expressed in the experiments of 
Example 1 8 had at least a 2-fold difference in expression between tumor and normal tissue. 

In conclusion. Appellants submit that the evidence reported in Example 18, supported by 
the first Grimaldi Declaration, establish that there is at least a two-fold difference in PRO 1864 
mRNA between melanoma and normal skin tissue, respectively. Therefore, it follows that the 
PRO 1864 gene, polypeptide, and antibody can be used to distinguish melanoma from normal 
skin tissue. The Examiner has not offered any significant arguments or evidence to the contrary, 
and therefore has not established a prima facie case that one of skill in the art would reasonably 
doubt the asserted utility. 
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b. The references cited by the Examiner do not refute Appellants* assertion 
that a chanse in mRNA levels leads to a corresponding change in the 
level of the encoded protein 

Appellants turn next to Examiner's second argument that there is no correlation between 
mRNA levels and protein levels. For the reasons discussed below, none of these references are 
contrary to Appellants' assertion that generally speaking, changes in mRNA levels lead to 
corresponding changes in the level of polypeptide. 

The Examiner cites Gokman-Polar et al as teaching that there is no necessary correlation 
between increased mRNA levels and increased protein levels. Final Office Action at 4-5, In 
particular, the PTO relies on a statement from Gokman-Polar that "Quantitative reverse 
transcription-PCR analysis revealed that the PKC mRNA levels do not directly correlate with 
PKC protein levels, indicating that PKC isoenzyme expression is likely regulated at the 
posttranscriptional/translational level." Final Office Action at 5. 

A close review of Gokman-Polar indicates that with one exception, the trend in the data is 
that mRNA and protein levels are positively correlated, supporting Applicants assertion that 
increased mRNA levels correlate vdth increased protein levels. In Figure 2, the protein level of 
two isozymes shows a decrease, while the third is increased. This same pattern is seen for the 
corresponding mRNA levels in Figure 6, although admittedly the increase in mRNA for the third 
isozyme is minimal. Similarly, comparing the protein levels of the three isozymes in Figure 4 to 
the corresponding mRNA levels in Figure 7, with one exception the mRNA levels are positively 
correlated to protein levels. While protein levels do not increase or decrease in direct proportion 
to the changes in mRNA, the trend in five of the six examples is that protein levels are positively 
correlated to mRNA levels. This evidence is hardly sufficient to establish that one of skill in the 
art would reasonably doubt that there is a reasonable correlation between mRNA levels and 
protein levels. 

In addition, the Examiner maintains that Haynes et al. and Gygi et al indicate that there 
is no correlation between mRNA levels and protein levels. Final Office Action at 4-5. 

Appellants note that neither Gygi nor Haynes looked at whether a change in transcript 
level for a particular gene led to a change in the level of expression of the corresponding protein. 
Instead, these references considered whether the steady-state transcript level correlated with the 
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steady-state level of the corresponding protein. These references disclose that similar mRNA 
levels for different genes did not universally result in equivalent protein levels for the different 
gene products, and similar protein levels for different gene products did not universally result 
from equivalent mRNA levels for the different genes. These results are expected, since there are 
many factors that determine translation efficiency for a given transcript, or the half-life of the 
encoded protein. Not surprisingly, based on these results, Haynes concluded that protein levels 
cannot always be accurately predicted from the level of the corresponding mRNA transcript when 
looking at the level of transcripts across different genes. Furthermore, Appellants note that, 
consistent with Appellants' position, Gygi found that "there was a general trend of increased 
protein levels resulting from increased mRNA levels." Gygi p. 1726. 

Importantly, Haynes and Gygi did not say that for a single gene, the level of mRNA 
transcript is not positively correlated with the level of protein expression. Appellants have 
asserted that increasing or decreasing the level of mRNA for the same gene leads to a increase or 
decrease for the corresponding protein. Haynes and Gygi did not study this issue and say 
absolutely nothing about it. One cannot look at the level of mRNA across several different genes 
to investigate whether a change in the level of mRNA a particular gene leads to a change in the 
level of protein for that gene, Haynes and Gygi have nothing to do with changes in protein levels 
resultant from changes in mRNA levels because they did not examine whether a change in 
transcript level for a particular gene led to a change in the level of expression of the 
corresponding protein. Thus, when these references assert that protein levels cannot be 
accurately predicted from mRNA levels, this refers to the finding that cellular protein levels 
cannot always be calculated simply based on cellular mRNA levels. This is completely unrelated 
to the expectation of a change in protein levels as a ftmction of the change in encoding mRNA 
levels. Neither Haynes nor Gygi provide any insight whatsoever into the affects on protein levels 
caused by a change in the encoding mRNA levels. Appellants assert that increasing or 
decreasing the level of mRNA for the same gene leads to an increase or decrease for the 
corresponding protein. Since this issue is not addressed in Haynes and Gygi, these references 
offer no support for the PTO's position. 
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c. Conclusion - Examiner has failed to establish a prima facie case that 
one of skill in the art would doubt Appellants* asserted utility 

The Examiner challenges the sufficiency of the data presented in Example 18. In 
particular, in the first Office Action the Examiner cited Hu et al as teaching that "for genes 
displaying a 5 -fold change or less in tumors compared to normal, there was no evidence of a 
correlation between altered gene expression and a known role in the disease." First Office Action 
at 6-7. In the final Office Action, the Examiner states "The response states that the general 
findings of Hu are not surprising and the strongest known relationship to disease is measured by 
the greater change in expression level and Hu's methodology yields results that prove little or no 
information regarding the biological significance of genes with less than 5 fold expression 
change (see page 20-22 of response). In response to this, the argument does not present any 
evidence that under 5 fold would be useful for diagnosis as asserted because the PRO 1864 is 
only overexpressed at 2 fold." 

The Examiner cites Gokman-Polar et al. as teaching that there is no necessary correlation 
between increased mRNA levels and increased protein levels. Final Office Action at 4-5. In 
addition, the Examiner maintains that Haynes et al and Gygi et al indicate that there is no 
correlation between mRNA levels and protein levels. Final Office Action at 4-5. 

Appellants have responded to each of these arguments in turn. First, Appellants have 
shown that the data in Example 18 are sufficient to show that PRO 1864 is useful as a cancer 
diagnostic tool. This assertion is supported by the first Grimaldi declaration. The Examiner has 
not provided any substantial reason or evidence for one of skill in the art to doubt the reliability 
or usefulness of Example 1 8, or the facts and conclusions in the first Grimaldi declaration. In 
particular, Appellants maintain Hu et al does not contradict Appellants' position because Hu et 
al focuses on the role of polypeptides in cancer while Appellants asserted utility does not depend 
on any "role" that the PRO 1864 gene or polypeptide may play in skin cancer. Furthermore, with 
respect to the Examiner's assertion that " the argument does not present any evidence that under 
5 fold would be useful for diagnosis as asserted because the PRO 1864 is only overexpressed at 2 
fold," Appellants note that, as indicated in Mr. Grimaldi's first declaration, the mRNAs 
identified as being differentially expressed in the experiments of Example 18 had at least a 2-fold 
difference in expression between tumor and normal tissue. 
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With respect to Gokman-Polar Appellants have shown that a close review of this 
reference indicates that with one exception, the trend in the data is that mRNA and protein levels 
are positively correlated, supporting Applicants assertion that, more often than not, increased 
mRNA levels correlate with increased protein levels. 

With respect to Haynes et al and Gygi et al, Appellants have shown that these references 
are simply not relevant to the issue of whether a change in mRNA levels leads to a corresponding 
change in the level of the encoded protein. 

Taken together, the Examiner's arguments are not sufficient to satisfy the Examiner's 
burden to "provide[] evidence showing that one of ordinary skill in the art would reasonably 
doubt the asserted utility," In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 
1995). The Examiner's arguments are largely conclusory statements which are not supported by 
any substantial evidence or reasoning which explains why one of ordinary skill in the art would 
reasonably doubt the asserted utility. Therefore, the Board should accept the Appellants' 
disclosure of utility. See Ex parte Rubin, 5 U.S.P.Q. 2d 1461 (Bd. Pat. App, & Interf 1987) 
("There is no factual support in this record for the examiner's questioning of the denaturation test 
reported in the specification. ... No reason to doubt 'the objective truth' of the asserted utility 
having been advanced by the examiner, we accept appellant's disclosure of utility corresponding 
in scope to the claimed subject matter."). 

7. Appellants have provided Sufficient Rebuttal Evidence of Utility 

"Only after the PTO provides evidence showing that one of ordinary skill in the art would 

reasonably doubt the asserted utility does the burden shift to the applicant to provide rebuttal 

evidence." In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995). The rebuttal 

evidence must be sufficient such that when it is considered as a whole, it is more likely than not 

that the asserted utility is true. See In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q,2d 1443, 

1444 (Fed. Cir. 1992) (stating that the evidentiary standard to be used throughout ex parte 

examination in setting forth a rejection is a preponderance of the evidence, or "more likely than 

not" standard). The M.P.E.P. summarizes the standard of proof required: 

[T]he applicant does not have to provide evidence sufficient to establish that an 
asserted utility is true "beyond a reasonable doubt." Nor must the applicant 
provide evidence such that it establishes an asserted utility as a matter of 
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statistical certainty. Instead, evidence will be sufficient if, considered as a whole, 
it leads a person of ordinary skill in the art to conclude that the asserted utility is 
more likely than not true . M.P,KP, § 2107.02, part VII (emphasis in original, 
citations omitted). 

Appellants remind the Board that the Federal Circuit has stated that the standard for satisfying 
the utility requirement is a low one: "The threshold of utility is not high: An invention is 'useful' 
under section 101 if it is capable of providing some identifiable benefit." Juicy Whip, Inc. v. 
Orange Bang, Inc., 185 F.3d 1364, 1366, 51 U.S.P.Q. 2d 1700 (Fed. Cir. 1999). 

Even if the Examiner has satisfied his burden of presenting a prima facie case of lack of 
utility, Appellants have supplied more than enough rebuttal evidence, such that when considered 
as a whole, one of skill in the art would conclude that the asserted utility is more likely than not 
true. As discussed in detail below. Appellants have provided sufficient evidence that the gene 
encoding the PRO 1864 polypeptide is differentially expressed in certain cancers and can 
therefore be used as a diagnostic tool. In addition, Appellants have shown that it is well 
established in the art that there is a reasonable correlation between changes in mRNA level and 
changes in the corresponding protein level such that one of skill in the art would believe that the 
PRO 1864 polypeptide is also differentially expressed in certain cancers. Therefore, considering 
the evidence as a whole, one of skill in the art would believe that it is more likely than not that 
the claimed antibodies which bind to the PRO 1864 polypeptide are useful as diagnostic tools for 
cancer, particularly melanoma. 

a. Appellants have established that the sene encodins the PR01864 
polypeptide is differentially expressed in certain cancers 
As discussed above, the Examiner has not provided ^ly evidence or reasoning to 
challenge the reliability and significance of the data in Example 1 8 which reports that the mRNA 
for PRO 1864 is more highly expressed in melanoma compared to normal skin tissue. In contrast 
to this complete lack of evidence on the part of the Examiner, Appellants have submitted the first 
Grimaldi declaration. That declarafion establishes that it is the opinion of an expert in the field 
who has personal knowledge of the facts surrounding Example 1 8 that there is at least a two-fold 
difference in mRNA for PRO 1864 between the tumor tissue and the counterpart normal tissue, 
and that the PRO 1864 genes, polypeptides and antibodies are useful for differentiating tumor 
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tissue from normal tissue. The Examiner has not provided any evidence or reasoning to 
challenge the facts and conclusions of the first Grimaldi declaration in support of Example 18. 

Given the disclosure of Example 1 8 and the supporting first Grimaldi declaration on the 
one hand, and the complete lack of any evidence on the other, it is clear that considering the 
evidence as a whole, one of skill in the art would conclude that it is more likely than not that the 
PRO 1864 gene is differentially expressed in melanoma compared to normal skin tissue such that 
it is useful as a diagnostic tool to distinguish tumor tissue from normal tissue. 

As Appellants explain below, it is more likely than not that the PRO 1864 polypeptide is 
also differentially expressed in melanoma, and can therefore be used to distinguish tumor tissue 
from normal tissue. This provides utility for the claimed antibodies which bind to the PRO 1864 
polypeptide. 

b. Appellants have established that senerally there is a correlation between 
chanses in mRNA expression levels and chames in the expression level 
of the encoded protein 
Appellants next turn to the second portion of their argument in support of their asserted 
utility - that it is well-established in the art that in most cases a change in the level of mRNA for 
a particular protein leads to a corresponding change in the level of the encoded protein. Given 
Appellants' evidence of differential expression of the mRNA for the PRO 1864 polypeptide in 
melanoma, it is more likely than not that the PRO 1864 polypeptide is likewise differentially 
expressed, and therefore the claimed antibodies are useful as diagnostic tools, particularly for 
melanoma. 

In support of the assertion that changes in mRNA are positively correlated to changes in 
protein levels, Appellants submitted a second Declaration by J. Christopher Grimaldi, an expert 
in the field of cancer biology (originally submitted on April 29, 2005 as Exhibit 3 with the 
Appellants' Amendment and Response to Office Action). As stated in paragraph 5 of the 
declaration, "Those who work in this field are well aware that in the vast majority of cases, when 
a gene is over-expressed... the gene product or polypeptide will also be over-expressed.... This 
same principal applies to gene under-expression." Amendment and Response to Final Office 
Action at 13, quoting Second Grimaldi Declaration at ^ 5. Further, "increased mRNA 
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expression is expected to result in increased polypeptide expression, and the detection of 

decreased mRNA expression is expected to result in decreased polypeptide expression." Id. 

Appellants also submitted the declaration of Paul Polakis, Ph.D. an expert in the field of 

cancer biology (attached as Exhibit 3 to Appellants' Amendment and Response to Office Action 

submitted on April 29, 2005). As stated in paragraph 6 of his declaration: 

Based on my own experience accumulated in more than 20 years of research, 
including the data discussed in paragraphs 4 and 5 above [showing a positive 
correlation between mRNA levels and encoded protein levels in the vast majority 
of cases studied in relation to the present invention] and my knowledge of the 
relevant scientific literature, it is my considered scientific opinion that for human 
genes, an increased level of mRNA in a tumor cell relative to a normal cell 
typically correlates to a similar increase in abundance of the encoded protein in 
the tumor cell relative to the normal cell. In fact, it remains a central dogma in 
molecular biology that increased mRNA levels are predictive of corresponding 
increased levels of the encoded protein . Polakis Declaration at ^ 6 (emphasis 
added). 

Dr. Polakis acknowledges that there are published cases where such a correlation does not exist, 
but states that it is his opinion, based on over 20 years of scientific research, that "such reports 
are exceptions to the commonly understood general rule that increased mRNA levels are 
predictive of corresponding increased levels of the encoded protein." Polakis Declaration at 1 6. 

The statements of Grimaldi and Polakis are supported by the teachings in Molecular 
Biology of the Cell, a leading textbook in the field (Bruce Alberts, et al., Molecular Biology of 
the Cell (3^^ ed. 1994) (submitted on April 29, 2005 with Appellants' Amendment and Response 
to Final Office Action as Exhibit 4, hereinafter "Cell 3'"^") and (4'*" ed. 2002) (submitted on 
April 29, 2005 with Appellants' Amendment and Response to Office Action as Exhibit 5, 
hereinafter "Cell 4'*^")). Figure 9-2 of Cell 3^*^ shows the steps at which eukaryotic gene 
expression can be controlled. The first step depicted is transcriptional control. Cell 3*^^ provides 
that "[f]or most genes transcriptional controls are paramount. This makes sense because, of all 
the possible control points illustrated in Figure 9-2, only transcriptional control ensures that no 
superfluous intermediates are synthesized." Cell at 403 (emphasis added). In addition, the 
text states that "Although controls on the initiation of gene transcription are the predominant 
form of regulation for most genes , other controls can act later in the pathway fi-om RNA to 
protein to modulate the amount of gene product that is made." Cell at 453 (emphasis added). 
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Thus, as established in Cell 3*^ , the predominant mechanism for regulating the amount of protein 
produced is by regulating transcription. 

In Ceil 4^^, Figure 6-3 on page 302 illustrates the basic principle that there is a correlation 
between increased gene expression and increased protein expression. The accompanying text 
states that "a cell can change (or regulate) the expression of each of its genes according to the 
needs of the moment - most obviously by controlling the production of its mRNA'' Cell 4 at 
302 (emphasis added). Similarly, Figure 6-90 on page 364 of Cell 4'^ illustrates the path from 
gene to protein. The accompanying text states that while potentially each step can be regulated 
by the cell, " the initiation of transcription is the most common point for a cell to regulate the 
expression of each of its genes ." Cell at 364 (emphasis added). This point is repeated on 
page 379, where the authors state that of all the possible points for regulating protein expression, 
" [fjor most genes transcriptional controls are paramount ." Cell at 379 (emphasis added). 

Further support for Appellants' position can be found in the textbook, Genes VI, 
(Benjamin Lewin, Genes VI (1997)) (submitted on April 29, 2005 with Appellants' Amendment 
and Response to Office Action as Exhibit 6) which states "having acknowledged that control of 
gene expression can occur at multiple stages, and that production of RNA cannot inevitably be 
equated with production of protein, it is clear that the overwhelming majority of regulatory 
events occur at the initiation of transcription ." Genes VI at 847-848 (emphasis added). 

Additional support is also found in Zhigang et al. World Journal of Surgical Oncology 
2:13, 2004 (submitted on April 29, 2005 with Appellants' Amendment and Response to Office 
Action as Exhibit 7). Zhigang studied the expression of prostate stem cell antigen (PSCA) 
protein and mRNA to validate it as a potential molecular target for diagnosis and treatment of 
human prostate cancer. The data showed "a high degree of correlation between PSCA protein 
and mRNA expression" Zhigang at 4. Of the samples tested, 81 out of 87 showed a high degree 
of correlation between mRNA expression and protein expression. The authors conclude that "it 
is demonstrated that PSCA protein and mRNA overexpressed in human prostate cancer, and that 
the increased protein level of PSCA was resulted from the upregulated transcription of its 
mRNA." Id. at 6. Even though the correlation between mRNA expression and protein 
expression occurred in 93% of the samples tested, not 100%, the authors state that "PSCA may 
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be a promising molecular marker for the clinical prognosis of human Pea and a valuable target 
for diagnosis and therapy of this tumor." Id, at 7. 

Further, Meric et al. Molecular Cancer Therapeutics, vol. 1, 971-979 (2002), (submitted 
on April 29, 2005 with Appellants' Amendment and Response to Office Action as Exhibit 8), 
states the following: 

The fundamental principle of molecular therapeutics in cancer is to exploit the 
differences in gene expression between cancer cells and normal cells... [M]ost 
efforts have concentrated on identifying differences in gene expression at the level 
of mRNA, which can be attributable to either DNA amplification or to differences 
in transcription. Meric etal at 971 (emphasis added). 

Exploiting differences in gene expression between cancer cells and normal cells would not be a 
"fiindamental principle" of molecular cancer therapeutics if there were no significant correlation 
between gene expression and protein levels. Stated another way, changes in mRNA without 
corresponding changes in protein levels would have little or no effect on cellular biology, and 
those of skill in the art would have no reason to examine the differences in gene expression at the 
mRNA level without such a correlation. However, as one of skill in the art recognizes, there is a 
strong correlation between changes in mRNA and changes in protein level. It is because of this 
strong correlation that it remains a "fundamental principle" of molecular therapeutics in cancer to 
look at changes in mRNA level. 

Together, the declarations of Grimaldi and Polakis, the accompanying references, and the 
excerpts and references discussed above all establish that the accepted understanding in the art is 
that there is a reasonable correlation between changes in gene expression and changes in the level 
of the encoded protein. In contrast to this substantial amoimt of evidence supporting Appellants' 
position, the Examiner has cited Gokman-Polar et al, Haynes et al and Gygi et aL As discussed 
above, Appellants have shown that a close review of Gokman-Polar indicates that with one 
exception, the trend in the data is that mRNA and protein levels are positively correlated, 
supporting Appellants' assertion that increased mRNA levels correlate with increased protein 
levels. In addition, as discussed above, Haynes and Gygi are not relevant to the issue of whether 
a change in mRNA levels for a single gene leads to a change in the level of the corresponding 
protein. It is clear that when considered as a whole, the preponderance of the evidence clearly 
weighs in favor of Appellants. 
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Appellants have presented sufficient evidence to establish that the mRNA for PRO 1864 
is differentially expressed in melanoma compared to their normal tissue counterparts, and that it 
is more likely than not that this leads to differential expression of the PRO 1864 polypeptide. 
This makes the claimed antibodies useful for diagnosing cancer, particularly melanoma. Given 
the overwhelming amount of evidence in support of Appellants' position, and the near absence of 
any evidence in support of the Examiner's position, when considered as a whole the evidence 
leads a person of ordinary skill in the art to conclude that the asserted utility is more likely than 
not true. 

c. The asserted utility is specific 

Finally, Appellants address the PTO's assertion that the asserted utilities are not specific 
to the claimed antibodies which bind to the PRO 1864 polypeptide. 

Specific utility is defined as utility which is "specific to the subject matter claimed," in 
contrast to "a general utility that would be applicable to the broad class of the invention." 
M.P.E.P. § 2107.01 I. Appellants submit that the evidence of differential expression of the 
PRO 1864 gene and polypeptide in certain types of tumor cells, along with the declarations and 
references discussed above, provide a specific utility for the claimed antibodies which bind to the 
PRO 1864 polypeptide. 

As discussed above, there are significant data which show that the gene for the PRO 1864 
polypeptide is expressed at least two-fold higher in melanoma compared to normal skin tissue. 
These data are strong evidence that the PRO 1864 gene and polypeptide are associated with 
melanoma. Thus, contrary to the assertions of the Examiner, Appellants have provided evidence 
associating the PRO 1864 gene and polypeptide with a specific disease. The asserted utility for 
the claimed antibodies as a diagnostic tool for cancer, particularly melanoma, is a specific utility 
- it is not a general utility that would apply to the broad class of antibodies. 

8. The Examiner's Response to Appellants' Evidence is Insufficient to Rebut 

Appellants ' Ar2uments 
The Examiner has stated that the Grimaldi and Polakis declarations are insufficient to 
overcome the rejection of claims based on 35 U.S.C. §§ 101. Final Office Action at 3-4. 
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a. The Examiner's response to the First Grimaldi Declaration 

The Examiner asserts that the first Grimaldi Declaration ''only states that Example 18 
showed mRNA expression and the declaration does not state that the protein was expressed it 
only states that the peptide and antibodies to the peptide can be useful for diagnostics." Final 
Office Action at 3-4. 

The Examiner's statement has nothing to do with the accuracy of the conclusions 
expressed in the first Grimaldi Declaration, and does not provide a basis for rejecting 
Mr. Grimaldi's conclusions. In addition, as discussed above, Appellants have provided 
numerous references and the declaration of an additional expert which indicates that it is well 
established that generally, there is a correlation between changes in mRNA level and changes in 
the level of the corresponding protein. 

Appellants submit that the declaration of Mr. Grimaldi is based on personal knowledge of 
the relevant facts at issue. Mr. Grimaldi is an expert in the field and conducted or supervised the 
experiments at issue. Appellants remind the PTO that "[ojffice personnel must accept an opinion 
from a qualified expert that is based upon relevant facts whose accuracy is not being questioned." 
PTO Utility Examination Guidelines (2001) (emphasis added). In addition, declarations relating 
to issues of fact should not be summarily dismissed without an adequate explanation of how the 
declaration fails to rebut the Examiner's position. In re Alton 76 F.3d 1168 (Fed. Cir. 1996), 
The PTO has not supplied any reasons or evidence to question the accuracy of the facts upon 
which Mr. Grimaldi based his opinion. Mr. Grimaldi has personal knowledge of the relevant 
facts, has based his opinion on those facts, and the PTO has offered no reason or evidence to 
reject either the underlying facts or his opinion. Therefore, the PTO should accept Mr. 
Grimaldi 's statement that if a difference in gene expression is detected "this indicates that the 
gene and its corresponding polypeptide and antibodies against the polypeptide are useful for 
diagnostic purposes to screen samples to differentiate between normal and tumor." First 
Grimaldi Declaration at ^ 7. 

The Examiner also maintains that the asserted utilities are not substantial because "the 
data are set forth in the specification are preliminary at best because the specification does not 
teach the expression of the PRO 1864 polypeptide nor any particular biological activity of the 
polypeptide to which the claimed antibody binds." Advisory Action page 2. As discussed above, 
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Appellants maintain that the data in Example 18 are reliable. Furthermore, with respect to the 
Examiner's assertion that the claimed antibodies lack utility because the biological activity of the 
PRO 1864 polypeptide is not disclosed, Appellants submit that whether or not the biological 
activity of the PRO 1864 polypeptide is disclosed does not impact the use of antibodies which 
bind to the polypeptide as a diagnostic tool for cancer. One does not need to know what the 
biological role of the PRO 1864 polypeptide is to exploit its differential expression to distinguish 
tumor from normal tissue using the claimed antibodies. In fact the Revised Interim Utility 
Guidelines promulgated by the PTO recognize that proteins which are differentially expressed in 
cancer have utility. {See the caveat in Example 12 which state that the utility requirement is 
satisfied where a protein is expressed in melanoma cells but not on normal skin and antibodies 
against the protein can be used to diagnose cancer.) In addition, while Appellants appreciate that 
actions taken in other applications are not binding on the PTO with respect to the present 
application, Appellants note that the PTO has issued several patents claiming antibodies to 
differentially expressed polypeptides or methods employing such antibodies. {See, e.g., U.S. 
Patent No. 6,156,500 and U.S. Patent No. 6,562,343, submitted as Exhibits 10-11 on April 29, 
2005 with Appellants' Amendment and Response to Office Action). Thus, Appellants submit 
that they have established the utility of the claimed antibodies as a cancer diagnostic tool. 

b. The Examiner's response to the Second Grimaldi Declaration 
The Examiner notes that the second Grimaldi Declaration states that it is unlikely that one 
identifies an increase or decrease in mRNA without a corresponding change in the level of the 
encoded protein and that, in the rare case where the protein does not correlate with mRNA, this 
would still provide crucial information for the clinician. Final Office Action at 4. In response, 
the Examiner asserts that he has cited numerous references showing that mRNA does not 
correlate with protein and that the instances where mRNA and protein do not show a correlation 
are not as rare as indicated in the declaration. 

As discussed above, Appellants maintain that the data in Example 1 8 are reliable and that 
it is more likely than not that the PRO 1864 polypeptide is differentially expressed. As discussed 
above, Appellants have provided numerous references and the declaration of an additional expert 
which indicates that it is well established that generally, there is a correlation between changes in 
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mRNA level and changes in the level of the corresponding protein. Furthermore, as discussed 

above, the Gokman-Polar et al, Haynes et al and Gygi et al. references cited by the Examiner 
do not contradict Appellants' position that, in general, differential expression of mRNA 
correlates with differential expression of the encoded polypeptide. 

c. The Examiner's response to the Polakis Declaration 
The Declaration of Paul Polakis was asserted to be insufficient to overcome the utility 
rejection. The Examiner notes that "the Polakis declaration states that in 80% of the 
observations they have found that increases in the levels of a particular mRNA correlates with 
changes in the level of protein expressed from that mRNA in human tumor cells." Final Office 
Action at 4. In response to this argument, the Examiner asserts that he has cited art in the 112 
first rejection that shows that mRNA over-expression does not correlate with protein over- 
expression. Further, the Examiner asserts that "while the declaration may show a correlation 
between mRNA and protein over-expression in some cases, no evidence has been submitted that 
it is the norm rather than the exception that protein levels parallel gene expression in cancer 
cells." Final Office Action at 4. 

As discussed above, Appellants maintain that the data in Example 18 are reliable and that 
it is more likely than not that the PRO 1864 polypeptide is differentially expressed. As discussed 
above. Appellants have provided numerous references and the declaration of an additional expert 
which indicates that it is well established that generally, there is a correlation between changes in 
mRNA level and changes in the level of the corresponding protein. Furthermore, as discussed 
above, the Gokman-Polar et al , Haynes et al and Gygi et al. references cited by the Examiner 
do not contradict Appellants' position that, in general, differential expression of mRNA 
correlates with differential expression of the encoded polypeptide. 

rf. The Examiner's response to the Alberts and Lewin references submitted 
by Appellants 

Appellants submitted the Alberts and Lewin references in support of their position that, in 
general, differential mRNA expression correlates with differential expression of the encoded 
polypeptide. However, the Examiner asserts that "Alberts and Lewin actually support the 
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unpredictability in that they teach controls and at several levels these controls can have an effect 
on expression of the protein." Final Office Action at 4, 

The PTO does not address the teachings in Alberts that "the initiation of transcription is 
the most common point for a cell to regulate the expression of each of its genes" and ''[f)or most 
genes transcriptional controls are paramount ." Alberts at 379 (emphasis added). The PTO also 
does not address the teachings in Lewin that " the overwhelming majority of regulatory events 
occur at the initiation of transcription." Lewin at 848 (emphasis added). The PTO does not 
provide a basis for failing to address evidence of the "overwhelming majority" and "paramount" 
controls of gene expression, and instead focuses on other controls that can act later. Appellants 
do not deny that protein expression can be regulated at various levels. However, Appellants 
submit that a full consideration of the submitted evidence would lead to a conclusion that the 
"overwhelming majority" and "paramount" controls have a greater influence on gene expression 
than "other controls" that can act later. Accordingly, full consideration of this evidence would 
lead to a conclusion that mRNA levels are paramount in influencing polypeptide expression 
levels. 

e. The Examiner's response to the Zhi^ans reference submitted by 
Appellants 

Appellants submitted the Zhigang reference in support of their position that, in general, 
differential mRNA expression correlates with differential expression of the encoded polypeptide. 
The Examiner acknowledges that "Zhigang et al does show protein expression, however, the 
experiments were carried out to demonstrate this." Final Office Action at 4. The Examiner 
asserts that " Zhigang et al support that one needs to actually determine the expression of the 
protein in order to be sure of expression." Final Office Action at 4. 

Zhigang reported that the correlation between mRNA expression and protein expression 
occurred in 93% of the samples tested. Appellants submit that there is no requirement to provide 
evidence sufficient to establish an asserted utility as a matter of statistical certainty. "Instead, 
evidence will be sufficient if, considered as a whole, it leads a person of ordinary skill in the art 
to conclude that the asserted utility is more likely than not true ." M.P.E.P. at § 2107.02, part VII 
(2004) (emphasis in original, internal citations omitted). However, the PTO holds that a showing 
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of 93% correlation between mRNA and polypeptide levels is an indication of support for 
unpredictability of the relationship between mRNA and polypeptide levels. This is standard is 
inconsistent with the Utility Guidelines and governing case law. Since the PTO's rejection of the 
claimed antibodies as lacking utility is based on this incorrect standard, the basis for holding of a 
lack of utility cannot be maintained. 

9. The Courts have held that the Utility Requirement was Satisfied in Similar 
Cases 

The seminal decision interpreting the utility requirement of 35 U.S.C. § 101 is Brenner v. 
Manson, 383 U.S. 519, 148 U.S.P.Q. 689 (1966). At issue in Brenner was a claim to "a chemical 
process which yields an already known product whose utility - other than as a possible object of 
scientific inquiry - ha[d] not yet been evidenced." Id, at 529, 148 U.S.P.Q. at 693. The Patent 
Office rejected the claimed process for lack of utility because the product produced by the 
claimed process had no known use. See id. at 521-22, 148 U.S.P.Q. at 690. On appeal, the Court 
of Customs and Patent Appeals reversed, holding "where a claimed process produces a known 
product it is not necessary to show utility for the product." Id. at 522, 148 U.S.P.Q. at 691. 

In reviewing the lower court's decision, the Court made its oft quoted statement that 
"[t]he basic quid pro quo contemplated by the Constitution and the Congress for granting a patent 
monopoly is the benefit derived by the public from an invention with substantial utility. Unless 
and until a process is refined and developed to this point - where specific benefit exists in 
currently available form - there is insufficient justification for permitting an Appellant to engross 
what may prove to be a broad field." Id at 534-35, 148 U.S.P.Q. at 695. 

The first opinion of the C.C.P.A. applying Brenner was In re Kirk, 376 F.2d 936, 153 
U.S.P.Q. 48 (C.C.P.A. 1967). The invention claimed in Kirk was a set of steroid derivatives said 
to have valuable biological properties and to be of value "in the furtherance of steroidal research 
and in the application of steroidal materials to veterinary or medical practice." M at 938, 153 
U.S.P.Q. at 50. In affirming the claim rejection based on a lack of utility, the court held that the 
"nebulous expressions 'biological activity' or 'biological properties'" did not adequately convey 
how to use the claimed compounds." Id. at 941, 153 U.S.P.Q. at 52. The court also rejected 
Appellants' supporting affidavit, stating, "the sum and substance of the affidavit appears to be 
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that one of ordinary skill in the art would know 'how to use' the compounds to find out in the 
first instance whether the compounds are - or are not - in fact useful or possess useful properties, 
and to ascertain what those properties are." Id at 942, 153 U.S.P.Q. at 53. 

Since these early decisions, the courts have continued to clarify what is sufficient to 
satisfy the utility requirement. Three more recent decisions are of particular relevance to the 
instant application: Nelson v. Bowler, 626 F.2d 853, 206 U.S.P.Q. 881 (C.C.P.A. 1980), Cross v. 
lizuka, 753 F.2d 1040, 224 U.S.P.Q. 739 (Fed. Cir. 1985), and Fujikawa v. Wattanasin, 93 F.3d 
1559, 39 U.S.P.Q. 2d 1895 (Fed. Cir. 1996). 

The earliest of these cases, Nelson v. Bowler, involved an interference between two 
applications related to derivatives of naturally occurring prostaglandins (PG). Nelson, 626 F.2d 
at 854-55. The issue was whether Nelson had shown at least one utility for the compounds at 
issue to establish an actual reduction to practice. Id. at 855. The Appellants relied on two tests 
to prove practical utility: an in vivo rat blood pressure (BP) test and an in vitro gerbil colon 
smooth muscle stimulation (GC-SMS) test. In the BP test, the blood pressure of anesthetized rats 
was recorded on a polygraph chart to determine whether an injected compound had any effect. 
Responses were categorized as either a depressor (lowering) effect or a pressor (elevating) effect. 
Id, In the GC-SMS test a section of colon was excised from a freshly-killed gerbil for suspension 
in a physiological solution, and a lever arm was connected to the colon in such a way that any 
contraction was recorded as a polygraph trace. Id. The Board held that Nelson had not shown 
adequate proof of practical utility, characterizing the tests as "rough screens, uncorrelated with 
actual utility." Id, at 856. 

On appeal the C.C.P.A. reversed, holding that the Board "erred in not recognizing that 

tests evidencing pharmacological activity may manifest a practical utility even though they may 

not establish a specific therapeutic use." M The Court stated that "practical utility" was 

characterized as a use of the claimed discovery in a manner which provides some immediate 

benefit to the public, establishing the following rule: 

Knowledge of the pharmacological activity of any compound is obviously 
beneficial to the public. It is inherently faster and easier to combat illnesses and 
alleviate symptoms when the medical profession is armed with an arsenal of 
chemicals having known pharmacological activities. Since it is crucial to provide 
researchers with an incentive to disclose pharmacological activities in as many 
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compounds as possible, we conclude that adequate proof of any such activity 
constitutes a showing of practical utility. Id (emphasis added). 

The Court rejected Bowler's argument that the BP and GC-SMS tests are inconclusive 
showings of pharmacological activity since confirmation by statistically significant means did not 
occur until after the critical date. The Court stated that "a rigorous correlation is not necessary 
where the test for pharmacological activity is reasonably indicative of the desired response." Id. 
(emphasis added). The Court concluded that a " reasonable correlation " between the observed 
properties and the suggested use was sufficient to establish practical utility. Id. at 857. 

The sufficiency of a "reasonable correlation" in establishing utility was affirmed by the 
Court of Appeals for the Federal Circuit in Cross v. lizuka, 753 F.2d 1040, 224 U.S.P.Q. 739 
(Fed. Cir. 1985). In Cross, the subject of the interference before the Court was imidazole 
derivative compounds which inhibit the synthesis of thromboxane synthetase, an enzyme which 
leads to the formation of thromboxane A2. At the time the applications were filed, 
thromboxane A2 was postulated to be involved in platelet aggregation, which was associated with 
several deleterious conditions. Id, at 1042. 

The question before the Board and reviewed by the Court was whether lizuka was entitled 
to the benefit of his Japanese priority application. Id, The Japanese application disclosed that 
the imidazole derivatives showed strong inhibitory action for thromboxane synthetase from 
human or bovine platelet microsomes, an in vitro utility. Id. at 1043. Relying in part on Nelson, 
the Board held that tests evidencing pharmacological activity may manifest a practical utility 
even though they may not establish a specific therapeutic use, and concluded that the in vitro 
tests were sufficient to establish a practical utility. Id. 

On appeal, Cross argued that the basic in vitro tests conducted in cellular fi-actions did not 
establish a practical utility for the claimed compounds, and that more sophisticated in vitro or in 
vivo tests were necessary to establish a practical utility. Id. at 1050. The Court rejected this 
argument, noting that adequate proof of any pharmaceutical activity constitutes a showing of 
practical utility. Id, The Court accepted the argument that initial testing of compounds is widely 
done in vitro: 

[I\n vitro results... are generally predictive of in vivo test results, i.e., there is a 
reasonable correlation therebetween. Were this not so, the testing procedures of 
the pharmaceutical industry would not be as they are. lizuka has not urged, and 
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rightly so, that there is an invariable exact correlation between in vitro test results 
and in vivo test results. Rather, lizuka's position is that successful in vitro testing 
for a particular pharmacological activity establishes a significant probability that 
in vivo testing for this particular pharmacological activity will be successful. Id 
(emphasis added). 

The Court also noted that in previous decisions, its predecessor court had accepted 

evidence of in vivo utility as sufficient to establish practical utility. The Court reasoned that: 

This in vivo testing is but an intermediate link in a screening chain which may 
eventually lead to the use of the drug as a therapeutic agent in humans. We 
perceive no insurmountable difficulty, under appropriate circumstances, in finding 
that the first link in the screening chain, in vitro testing, may establish a practical 
utility for the compound in question. Successful in vitro testing will marshal 
resources and direct the expenditure of effort to further in vivo testing of the most 
potent compounds, thereby providing an immediate benefit to the public, 
analogous to the benefit provided by the showing of an in vivo utility . Id at 1051, 
citing Nelson, 626 F.2d at 856 (emphasis added). 

Based on this reasoning, the Court affirmed the decision of the Board, stating that "based 
upon the relevant evidence as a whole, there is a reasonable correlation between the disclosed in 
vitro utility and an in vivo activity, and therefore a rigorous correlation is not necessary where the 
disclosure of pharmacological activity is reasonable based upon the probative evidence." Id, at 
1050 (emphasis added). The Court therefore held that the disclosed in vitro utility was 
"sufficient to comply with the practical utility requirement of § 101." Id. at 1051. 

The holdings of Nelson and Cross were more recently affirmed in Fujikawa v. 

Wattanasin, 93 F.3d 1559, 39 U.S.P.Q.2d 1895 (Fed. Cir. 1996). In Fujikawa, the Court again 

affirmed the notion that initial screens of compounds provide a practical utility even though they 

may not provide a therapeutic use because "'[i]t is inherently faster and easier to combat illnesses 

and alleviate symptoms when the medical profession is armed with an arsenal of chemicals 

having known pharmacological activities.'" Id. at 1564, quoting Nelson, 626 F.2d at 856. The 

Court noted that it may be difficult to predict whether novel compounds will exhibit 

pharmacological activity, and consequently testing is often required to establish practical utility. 

Id, However the Court went on to state: 

But the test results need not absolutely prove that the compound is 
pharmacologically active. All that is required is that the tests be ''reasonably 
indicative of the desired [pharmacological] response." In other words, there must 
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be a sufficient correlation between the tests and an asserted pharmacological 
activity so as to convince those skilled in the art, to a reasonable probability , that 
the novel compound will exhibit the asserted pharmacological behavior." Id. 
(intemal citations omitted, underline emphasis added, italics in original). 

On appeal, Fujikawa argued that Wattanasin had failed to establish an adequate 
correlation between the in vitro and in vivo results to permit Wattanasin to rely on positive in 
vitro results to establish a practical utility. The Court stated that the Board relied on testimony 
from those skilled in the art that the in vitro results convinced the experts that the claimed 
compounds would exhibit the desired pharmacological activity when administered in vivo, 
including testimony that in vivo activity is typically highly correlatable to a compound's in vitro 
activity in the field. Id. at 1565. To overcome this evidence and counter the Board's decision, 
Fujikawa pointed to the testimony of its expert that "there is a reasonable element of doubt that 
some elements may be encountered which are active in the in vitro assay, but yet inactive in the 
in vivo assay." Id. 

The Court rejected this argument: "Of course, it is possible that some compounds active 
in vitro may not be active in vivo. But, as our predecessor court in Nelson explained, a 'rigorous 
correlation' need not be shown in order to establish practical utility; 'reasonable correlation' 
suffices ." Id. (emphasis added). The Court also rejected Fujikawa's reliance on two articles. 
The Court noted that while one article taught that ''in vitro testing is sometimes not a good 
indicator of how potent a compound will be in vivo, it does imply that compounds which are 
active in vitro will normally exhibit some in vivo activity." Id. at 1566. Similarly, the Court 
noted that the second article expressly stated that "[f|or most substances, although not for all, the 
relative potency determined in in vitro ... parallels the in vivo activity." Id. 

The Court concluded that the facts in the case were analogous to the ones in Cross where 
the court relied on a known reasonable correlation between in vitro tests and in vivo activity, and 
therefore affirmed the Board's decision that Wattanasin had established a practical utility with 
the in vitro results. Id. at 1 565-66. 

The Nelson, Cross, and Fujikawa cases are very similar to the present case. The 
reasoning of the courts in all three cases that "'[i]t is inherently faster and easier to combat 
illnesses and alleviate symptoms when the medical profession is armed with an arsenal of 
chemicals having known pharmacological activities'" applies to the asserted utility for the 
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claimed antibodies. Fujikawa, 93 F.3d at 1564, quoting Nelson, 626 F.2d at 856; see also Cross, 
753 F.2d at 1051 ("Successful in vitro testing will marshal resources and direct the expenditure 
of effort to further in vivo testing of the most potent compounds, thereby providing an immediate 
benefit to the public, analogous to the benefit provided by the showing of an in vivo utility.")- 
Like pharmaceutical compounds, nucleic acids, polypeptides, and antibodies which are 
associated with cancer will make it inherently faster and easier to combat cancer. The greater the 
number of biological markers of cancer medical professionals have access to, the more accurate 
and detailed a diagnosis they can make. The determination that a gene is differentially expressed 
in cancer constitutes at least as significant a development in the field of cancer diagnostics as in 
vitro screening for pharmaceutical activity. See Cross, 753 F.2d at 1051 ("the first link in the 
screening chain, in vitro testing, may establish a practical utility for the compound in question. 
Successful in vitro testing will marshal resources and direct the expenditure of effort to further in 
vivo testing of the most potent compounds, thereby providing an immediate benefit to the 
public"). 

In addition, like in vitro tests in the pharmaceutical industry, those of skill in the field of 
biotechnology rely on the reasonable correlation that exists between gene expression and protein 
expression (see discussion supra). Were there no reasonable correlation between the two, the 
techniques that measure gene levels such as microarray analysis, differential display, and 
quantitative PGR would not be so widely used by those in the art. See Second Grimaldi 
Declaration at TI 5. As in Cross, Appellants here do not argue that there is "an invariable exact 
correlation" between gene expression and protein expression. See Cross, 753 F.2d at 1050. 
Instead, Appellants' position detailed above is that a measured change in gene expression in 
cancer cells establishes a "significant probability" that the expression of the encoded polypeptide 
in cancer will also be changed based on "a reasonable correlation therebetween." Id/, see also 
Fujikawa, 93 F.3d at 1565 ("a 'rigorous correlation' need not be shown in order to establish 
practical utility; 'reasonable correlation' suffices"); Nelson, 626 F.2d at 857 (holding that "a 
rigorous correlation is not necessary" and that a "reasonable correlation" will suffice). 

Also of importance is the Court's rejection of the notion that any in vitro testing must be 
statistically significant to support a practical utility. Nelson, 626 F.2d at 857. Likewise, 
qualitative characterizations of a test compound as either increasing or decreasing blood pressure 
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was acceptable. Id at 855 (stating that responses were categorized as either a depressor 
(lowering) effect or a pressor (elevating) effect). This is similar to the data in Example 18, where 
the change in mRNA levels is described as "more highly expressed." 

There are additional similarities. In Fujikawa, the Board and Court rejected the argument 
that there was no utility because there was no exact correlation between the in vitro and in vivo 
results in spite of supporting testimony and references. Fujikawa, 93 F.3d at 1565-66. Like the 
two references rejected by the Board and Court in Fujikawa, the Examiner asserts that the 
correlation between changes in mRNA levels and protein levels is not exact. But like Fujikawa, 
Appellants have submitted the declaration of two experts in the field which state that those in the 
field rely on the correlation between changes in mRNA and protein. See Second Grimaldi 
Declaration at \ 5; Polakis Declaration at ^ 6. Thus, as was the case in Fujikawa, although there 
may be some evidence that the correlation relied on is not exact, the declarations and numerous 
references submitted by Appellants is more than enough evidence to establish that there is a 
"reasonable correlation" between changes in mRNA levels and changes in the level of the 
encoded protein. 

In conclusion, Appellants have asserted that the claimed antibodies are useful for the 
diagnosis of cancer, particularly melanoma, based on the data in Example 18. This utility is far 
beyond the nebulous expressions "biological activity" or "biological properties" rejected in In re 
Kirk, 376 F.2d 936, 153 U.S.P.Q. 48 (C.C.P.A. 1967). Like Nelson, Cross, and Fujikawa, 
Appellants have asserted a utility which relies on a reasonable correlation between the data 
disclosed in the application and the asserted utility. The fact that there may be limited evidence 
that the correlation is not exact does not invalidate Appellants' showing of utility since the 
correlation need not be a rigorous or exact one. Considering the relevant evidence as a whole, 
Appellants have provided sufficient evidence to establish a reasonable correlation between 
changes in the level of mRNA and corresponding changes in the level of the encoded 
polypeptide. Therefore the claimed antibodies have a practical utility as diagnostic tools for 
melanoma. 
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10. Groupins of Rejected Claims 

For the purposes of the utility rejection under 35 U.S.C. §101, Claims 1-5 can be 
considered as a group. 

11. Utility - Conclusion 

Appellants' asserted utility for the claimed antibodies as diagnostic tools for cancer 
corresponds in scope to the subject matter sought to be patented and therefore "must be taken as 
sufficient to satisfy the utility requirement of § 101 for the entire claimed subject." In re Langer, 
503 F.2d 1380, 1391, 183 U.S.P.Q. 288, 297 (C.C.P.A. 1974). The Examiner's unsupported 
arguments and references are not sufficient evidence to make a prima facie showing that "one of 
ordinary skill in the art would reasonably doubt the asserted utility." In re Brana, 51 F.3d 1560, 
1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995). 

And even if the Examiner has established a prima facie case, Appellants have offered 
sufficient rebuttal evidence in the form of expert declarations and references, which, when 
considered as a whole, establish that it is more likely than not that the asserted utility is true. See 
In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P,Q.2d 1443, 1444 (Fed. Cir. 1992) (stating that the 
evidentiary standard to be used throughout ex parte examination in setting forth a rejection is a 
preponderance of the evidence, or "more likely than not" standard); M,P.KP, at § 2107.02, part 
VII ("evidence will be sufficient if, considered as a whole, it leads a person of ordinary skill in 
the art to conclude that the asserted utility is more likely than not true .") (emphasis in original). 

Finally, the courts' decisions in similar cases make clear that the evidence provided by 
Appellants is sufficient to establish the asserted utility. The evidence does not need to be direct 
evidence, nor does it need to provide an exact correlation between the submitted evidence and 
the asserted utility. Instead, evidence which is "reasonably" correlated with the asserted utility is 
sufficient. See Fujikawa, 93 F.3d at 1565 ("a 'rigorous correlation' need not be shown in order 
to establish practical utility; 'reasonable correlation' suffices"); Cross^ 753 F.2d at 1050 (same); 
Nelson^ 626 F.2d at 857 (same). Considering the evidence as a whole in light of the relevant 
cases, the Board should find that Appellants have established at least one specific, substantial, 
and credible utility, and the Examiner's rejection of the pending claims as lacking utility should 
be reversed. 
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C. Enablement Rejection - Detailed Argument 

The second issue before the Board is whether Appellants have enabled the pending claims 
such that one of skill in the art would be able to make and use the claimed invention. The 
Examiner has rejected Claims 1-5 under 35 U.S.C. §112, first paragraph, arguing that because 
the claimed invention is not supported by either a specific or substantial asserted utility or a well- 
established utility, one skilled in the art would not know how to use the claimed invention. See 
Final Office Action at 7. 

The Examiner also asserts that the references discussed above demonstrate that it is 
unpredictable whether there is a correlation between mRNA and protein. 

Furthermore, the Examiner also asserts that "Pennica demonstrates that each gene 
amplification and correlation to protein overexpression needs to be determined by a case by case 
basis because even Pennica's three gene expression do not correlate and one skill in the art would 
not disregard the data in a paper just because it did not confirm ones theory." Final Office Action 
at 8. 

The Examiner asserts that "the predictability of protein translation and its possible utility 
as a diagnostic are not necessarily contingent on the levels of mRNA expression due to the 
multitude of homeostatic factors affecting transcription and translation. Therefore, absent 
evidence of the protein's expression including the correlation to a diseased state, one of skill in 
the art would be unable to predictably use the polypeptides in any diagnostic setting without 
undue experimentation. In view of the lack of guidance, lack of examples, and lack of 
predictability in the art and using the myriad of derivatives encompassed in the scope of the 
claims, one skilled in the art would be forced into undue experimentation in order to practice the 
broadly claimed invention." Final Office Action at 8. 

In addition, in the Advisory Action mailed September 21, 2005, the Examiner cites the 
statement in Hanash et aL (Nature Reviews, Applied Proteomics Collection, pp. 9-14, March 
2005) that "[t]here is a need to profile gene expression at the level of the proteome and to 
correlate changes in gene-expresion profiles with changes in proteomic profiles. The two are not 
always linked-numerous alterations occur in protein levels that are not reflected at the RNA 
level." The Examiner also asserts that the Hanash reference teaches that tumors are complex 
and that analyses must be conducted at both the genomic and proteomic levels. 
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In the Advisory Action the Examiner cites another reference by Hanash et al (The 
Pharmacogenomics Journal, 3(6):308, 2003) as teaching that "there is a need to assay protein 
levels and activities and numerous alterations may occur in proteins that are not reflected in 
changes at the RNA level." Advisory Action at 2. The Examiner also states that "[t]he literature 
supports that RNA expression cannot inevitably be correlated with levels of the encoded 
polypeptide and one skilled in the art would not assume that the levels of RNA are predictive of 
the levels of the encoded polypeptide given the distinct regulation of transcription and 
translation." Advisory Action at 2. 

1, Enablement ~ Lesal Standard 

An application enables the claims "if one skilled in the art, after reading the[] 
disclosure^, could practice the invention claimed ... without undue experimentation." Chiron 
Corp. V. Genenteck Inc., 363 F.3d 1247, 1253 (Fed. Cir. 2004). "But the question of undue 
experimentation is a matter of degree. The fact that some experimentation is necessary does not 
preclude enablement; what is required is that the amount of experimentation 'must not be unduly 
extensive.'" PPG Indus., Inc. v. Guardian Indus., Corp., 75 F.3d 1558, 1564 (Fed. Cir. 1996) 
(quoting Atlas Powder Co. v. E.I DuPont de Nemours & Co., 750 F.2d 1569, 1576 (Fed. Cir. 
1984)). 

While the application must enable one of ordinary skill in the art to practice the full scope 
of the claimed invention, "[t]hat is not to say that the specification itself must necessarily 
describe how to make and use every possible variant of the claimed invention, for the artisan's 
knowledge of the prior art and routine experimentation can often fill gaps, interpolate between 
embodiments, and perhaps even extrapolate beyond the disclosed embodiments, depending upon 
the predictability of the art." AK Steel Corp. v. Sollac, 344 F.3d 1234, 1244 (Fed. Cir. 2003). 

2. Enablement - Burden of Proof 

In order to make an enablement rejection, the PTO has the initial burden to establish a 
reasonable basis to question the enablement provided for the claimed invention. See M.P.E.P. § 
2164,04. A specification teaching how to make and use the claimed subject matter must be taken 
as being in compliance with the enablement requirement unless there is a reason to doubt the 
objective truth of the statements contained therein which are relied on for enabling support. Id. 
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It is incumbent for the PTO "to explain why it doubts the truth or accuracy of any statement in a 
supporting disclosure and to back up assertions of its own with acceptable evidence or reasoning 
which is inconsistent with the contested statement." Id, (quoting In re Marzocchi, 439 F.2d 220, 
224, 169 U.S.P.Q. 367, 370 (C.C.P.A. 1971). This can be done "by making specific findings of 
fact, supported by the evidence, and then drawing conclusions based on these findings of fact." 
Id 

3. Enablement - Standard of Proof 

Once the examiner has weighed all the evidence and established a reasonable basis to 
question the enablement provided for the claimed invention, the burden falls on the applicant to 
present persuasive arguments, supported by suitable proofs where necessary, that one skilled in 
the art would be able to make and use the claimed invention using the application as a guide. See 
M.P,E.P. § 2164.05. "The evidence provided by applicant need not be conclusive but merely 
convincing to one skilled in the art." Id. (bold emphasis added, underline in original). "A 
declaration or affidavit is, itself, evidence that must be considered ." Id. (emphasis in original). 

The examiner must then "weigh all the evidence before him or her, including the 
specification and any new evidence supplied by applicant with evidence and/or sound scientific 
reasoning previously presented in the rejection and decide whether the claimed invention is 
enabled." Id, "The examiner should never make the determination based on personal opinion. 
The determination should always be based on the weight of all the evidence." Id. (emphasis in 
original). 

4. Appellants* Specification Teaches How to Make and Use the Claimed Subject 
Matter 

The claimed subject matter relates to antibodies which specifically bind to the 
polypeptides of SEQ.ID NO: 14. The specification discloses how to make the polypeptide of 
SEQ ID NO: 14, for example in Paragraphs [0283]-[0309], [0313]-[0315] and Examples 6-9 of 
the specification. Methods for making and testing antibodies for specificity are well known in 
the art, and are disclosed in the specification, at Paragraphs [0361]-[0405] and Paragraph [0493]. 
See also In re Wands, 858 F.2d 731 (reversing the Board's decision of non-enablement and 
holding that as of 1980, undue experimentation was not required to make high-affinity 
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monoclonal antibodies to a target peptide). Methods for using the antibodies as diagnostic agents 
are well known in the art and are disclosed in the specification at Paragraph [0407]. In particular, 
the specification discloses use of the antibodies in diagnostic assays to detect differential 
expression. See Specification at Paragraph [0407]. Antibodies are also discussed in Paragraphs 
[0225], [0238]-[0247] and [0310]-[0312]. In light of the differential expression of mRNA 
encoding the PRO 1864 polypeptide in melanoma, one of skill in the art would expect the 
PR01864 polypeptide to be differentially expressed in these tumors as well. Therefore, given the 
teaching in the specification on how to make and use the claimed antibodies to detect expression 
of PRO 1864 in specific tissues, one of skill in the art would be enabled to practice the claimed 
invention without undue experimentation. 

Because Appellants' specification teaches how to make and use the claimed subject 
matter, it must be taken as being in compliance with the enablement requirement unless there is a 
reason to doubt the objective truth of the statements contained therein which are relied on for 
enabling support. See M.P,KP, §2164.04. Id 

5. The Examiner Arsuments and Appellants * Responses 

The PTO has the initial burden to establish a reasonable basis to question the enablement 
provided for the claimed invention. See M,P,KP. § 2164.04. It is incumbent for the PTO "to 
explain why it doubts the truth or accuracy of any statement in a supporting disclosure and to 
back up assertions of its own with acceptable evidence or reasoning which is inconsistent with 
the contested statement." Id (quoting In re Marzocchi, 439 F.2d 220, 224, 169 U.S.P.Q. 367, 
370 (C.C.P.A. 1971). This can be done "by making specific findings of fact, supported by the 
evidence , and then drawing conclusions based on these findings of fact." Id. 

The Examiner has rejected Claims 1-5 under 35 U.S.C. §112, first paragraph, arguing that 
because the claimed invention is not supported by either a specific or substantial asserted utility 
or a well-established utility, one skilled in the art would not know how to use the claimed 
invention. See Final Office Action at 7. 

As discussed above, Appellants maintain that the claimed antibodies possess utility. 

The Examiner also asserts that the references discussed above demonstrate that it is 
unpredictable whether there is a correlation between mRNA and protein. As discussed above, 
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Appellants have shown that a close review of Gokman-Polar indicates that with one exception, 
the trend in the data is that mRNA and protein levels are positively correlated, supporting 
Applicants assertion that increased mRNA levels correlate with increased protein levels. 
Furthermore, Appellants have shown that the Haynes and Gygi references are simply not relevant 
to the issue of whether a change in mRNA levels for a single gene leads to a corresponding 
change in the level of the encoded protein. Accordingly, Appellants maintain that the references 
cited by the Examiner do not contradict Appellants' position that, in general, differential mRNA 
expression correlates with differential expression of the encoded polypeptide. In contrast to the 
Examiner's position. Appellants maintain that the declarations of Grimaldi and Polakis, the 
accompanying references, and the excerpts and references discussed above all establish that the 
accepted understanding in the art is that there is a reasonable correlation between changes in gene 
expression and changes in the level of the encoded protein. 

The Examiner also asserts that "Pennica demonstrates that each gene amplification and 
correlation to protein overexpression needs to be determined by a case by case basis because 
even Pennica's three gene expression do not correlate and one skill in the art would not disregard 
the data in a paper just because it did not confirm ones theory." Final Office Action at 8. 

Appellants begin by clarifying that the data concerning the differential expression of the 
PRO 1864 gene presented in Example 18 relate to gene expression, not gene amplification . The 
description of Example 18 makes clear that the results were obtained by quantitative PGR 
amplification of cDNA libraries. It is well known in the art that cDNA libraries are made from 
mRNA, and reflect the level of mRNA for a particular gene in the source tissue. Thus, Example 
18 is reporting a measure of the expression of the PRO 1864 gene, i,e, mRNA levels, not its 
amplification, i.e. the number of copies of the PRO 1864 gene in the genome. 

For this reason, the relationship between gene copy number and the level of gene 
expression, i.e, mRNA levels, is not relevant to the instant application. The distinction between 
gene copy number and gene expression has been clarified for the Examiner on numerous 
occasions. 

In addition to noting the PTO's confusion between an increase in copy number of a gene 
or gene amplification on the one hand, and increased expression of a gene or mRNA expression 
on the other, Appellants further note that Pennica's results relating to the observation that the 
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WISP-2 gene DNA was amplified in colon tumors, but its mRNA expression was significantly 
reduced in the majority of tumors compared with the expression in normal colonic mucosa from 
the same patient may not even be real. In particular, the authors of the Pennica reference explain 
that: "Because the center of the 20ql3 amplicon [of which WISP-2 is a part] has not yet been 
identified, it is possible that the apparent amplification observed for WISP'2 may be caused by 
another gene in this amplicon." Pennica at 14722 (emphasis added). 

However, even if the lack of correlation between DNA copy number and mRNA level in 
Pennica is real, Appellants reiterate that Pennica says nothing about a lack of correlation between 
the level of mRNA and the level of protein expression - Pennica did not even look at protein 
expression. It is the correlation between mRNA level, as assessed by probing the cDNA library, 
and the level of protein expression which is at issue here, not the correlation of gene copy 
number and mRNA levels. 

The Examiner asserts that "the predictability of protein translation and its possible utility 
as a diagnostic are not necessarily contingent on the levels of mRNA expression due to the 
multitude of homeostatic factors affecting transcription and translation. Therefore, absent 
evidence of the protein's expression including the correlation to a diseased state, one of skill in 
the art would be unable to predictably use the polypeptides in any diagnostic setting without 
undue experimentation. In view of the lack of guidance, lack of examples, and lack of 
predictability in the art and using the myriad of derivatives encompassed in the scope of the 
claims, one skilled in the art would be forced into undue experimentation in order to practice the 
broadly claimed invention." Final Office Action at 8. 

Appellants acknowledge that gene expression is regulated at numerous levels. However, 
as discussed above, the Declarations and supporting references supplied by Appellants make it 
clear that regulation of mRNA levels is the predominant mechanism of control for the majority of 
genes. 

With respect to the Examiner's assertion that the PRO 1864 polypeptide has not been 
correlated to a disease state. Appellants maintain that, in view of the differential expression of 
the PRO 1864 polypeptide, one of skill in the art would be able to use the claimed antibodies as 
diagnostic tools regardless of whether or not the PRO 1864 polypeptide is the causative agent of 
melanoma. 
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As discussed above, the specification teaches how to make and use the claimed 
antibodies. Furthermore, as discussed above, methods for making and testing antibodies for 
specificity and methods for using the antibodies as diagnostic agents are well known in the art. 
Accordingly, Appellants maintain that the specification enables one skilled in the art to make and 
use the claimed antibodies without undue experimentation. 

In addition, in the Advisory Action mailed September 21, 2005, the Examiner cites the 
statement in Hanash et al (Nature Reviews, Applied Proteomics Collection, pp. 9-14, March 
2005) that "[tjhere is a need to profile gene expression at the level of the proteome and to 
correlate changes in gene-expresion profiles with changes in proteomic profiles. The two are not 
always linked-numerous alterations occur in protein levels that are not reflected at the RNA 
level" Advisory Action at 2. The Examiner also asserts that this Hanash reference teaches that 
tumors are complex and that analyses must be conducted at both the genomic and proteomic 
levels. 

The Examiner also cites another reference by Hanash et al (The Pharmacogenomics 
Joumal, 3(6):308, 2003) as teaching that "there is a need to assay protein levels and activities and 
numerous alterations may occur in proteins that are not reflected in changes at the RNA level." 
Advisory Action at 2. The Examiner also states that "[t]he literature supports that RNA 
expression cannot inevitably be correlated with levels of the encoded polypeptide and one skilled 
in the art would not assume that the levels of RNA are predictive of the levels of the encoded 
polypeptide given the distinct regulation of transcription and translation." Advisory Action at 2. 

Appellants acknowledge that gene expression is regulated at numerous levels. However, 
as discussed above, the Declarations and supporting references supplied by Appellants make it 
clear that regulation of mRNA levels is the predominant mechanism of control for the majority of 
genes. Therefore, Appellants maintain that there exists a well known and accepted correlation 
between mRNA levels and protein levels. Given this correlation, the specification fully enables 
one of ordinary skill in the art to make and use the claimed invention. 

L Groupins of Rejected Claims 

For the purposes of the enablment rejection under 35 U.S.C. §112, Claims 1-5 can be 
considered as a group. 
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2. 



Conclusion 



In view of the arguments presented above, Appellants maintain that the specification 
enables one skilled in the art to make and use the claimed antibodies. Accordingly, Appellants 
respectfully request withdrawal of the rejection under 35 U.S.C. §112. 

D. Conclusion 

In view of the arguments presented above, Appellants submit that the specification as 
filed provides a specific, substantial and credible utility for the claimed antibodies and request 
withdrawal of the rejection under 35 U.S.C. §101, and the related rejection under 35 U.S.C. 
§112, first paragraph. 

Please charge any additional fees, including any fees for additional extension of time, or 
credit overpayment to Deposit Account No. 11-1410. 



Respectfully submitted. 



KNOBBE, MARTENS, OLSON & BEAR, LLP 





Daniel Hart 

Registration No. 40,637 
Attorney of Record 
Customer No. 30,313 
(619) 235-8550 
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APPENDIX A - Claims on Appeal 

1 . An isolated antibody that specifically binds to the polypeptide of SEQ ID NO: 14. 

2. The antibody of Claim 1 which is a monoclonal antibody. 

3. The antibody of Claim 1 which is a humanized antibody. 

4. The antibody of Claim 1 which is an antibody fragment. 

5. The antibody of Claim 1 which is labeled. 
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APPENDIX B- Evidence 

Attached hereto is a copy of the evidence cited in Appellants' Brief. The list of evidence 
below is accompanied by a statement setting forth where in the record that evidence was entered 
into the record by the Examiner. 



Tab 


Reference 


Submitted 


Entered 


1 


T T ^ f ✓ T T» J. 

Hu et al. (J. Proteome 
Res., 2003) 2(4):405-12 




Cited by Examiner in the first 
Office Action 


2 


Haynes et al. 
(Electrophoresis, 1998) 
19(11):1862-1871 




Cited by Examiner in the first 
Office Action 


3 


Pennica et al (PNAS, 
1998) 95:14717-14722 




Cited by Examiner in the first 
Office Action 


4 


Gokmen-Polar et al 
(Cancer Research, 
2001)61:1375-1381 




Cited by Examiner in the final 
Office Action 


5 


First Declaration of J. 
Christopher Grimaldi 


Originally Submitted on 
April 29, 2005 with 
Appellants' Amendment and 
Response to Office Action as 
Exhibit 1 




6 


Second Declaration by 
J. Christopher Grimaldi 


Originally Submitted on 
April 29, 2005 with 
Appellants' Amendment and 
Response to Office Action as 
Exhibit 2 




7 


Declaration of Paul 
Polakis, Ph.D. 


Originally Submitted on 

April 29, 2005 with 
Appellants' Amendment and 
Response to Office Action as 
Exhibit 3 




8 


Bruce Alberts, et al 
(Molecular Biology of 
the Cell, 3'^ed. 1994) 


Originally Submitted on 
April 29, 2005 with 
Appellants' Amendment and 
Response to Office Action as 
Exhibit 4 




9 


Bruce Alberts, et al 
(Molecular Biology of 
the Cell, 4'*^ ed. 2002) 


Originally Submitted on 
April 29, 2005 with 
Appellants' Amendment and 





Response to Office Action as 
Exhibit 5 
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10 


Benjamin Lewin, 
Genes VI (1997) 


Originally Submitted on 
April 29, 2005 with 
Appellants' Amendment and 
Kesponse lO wiiicc /vciion as 
Exhibit 6 


11 


Zhigang et al (World 
Journal of Surgical 
Oncology, 2004) 2:13 


Originally Submitted on 

April 29, 2005 with 
Appellants' Amendment and 
Response to Office Action as 
Exhibit 7 


12 


Meric et al. (Molecular 
Cancer Therapeutics, 
2002) 1:971-979 


Originally Submitted on 
April 29, 2005 with 
Appellants' Amendment and 
Response to Office Action as 
Exhibit 8 


13 


Gygi et al. (Molecular 
and Cellular Biology, 
Mar. 1999) 1720-1730 


Originally Submitted on 

April 29, 2005 with 
Appellants' Amendment and 
Response to Office Action as 
Exhibit 9 


14 


U.S. Pat. No. 6,156,500 


Originally Submitted on 
April 29, 2005 with 
Appellants' Amendment and 
Response to Office Action as 
Exhibit 10 


15 


U.S. Pat. No. 6,562,343 


Originally Submitted on 
April 29, 2005 with 
Appellants' Amendment and 
Response to Office Action as 
Exhibit 1 1 



f 
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There are no decisions rendered by a court or the Board in any related proceedings 
identified above. 
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High-throughput technologies, such as proteomic screening and DNA mkro-aaays, produce vast 
amounts of data requiring comprehensive analytical methods to decipher the biologically relevant 
results. One approach would be to manually search the biomedical literature; however, this would be 
an arduous task. We developed an automated titerature-mlning tool, termed MedGene. wtilch 
comprehensively summarizes and estimates the relative strengths of all human gene-disease 
relationships In Medline. Usir^g IVledGene. we analyzed a novel nnlcro-array expression dataset 
comparing breast cancer and normal breast tissue in the context of exIsUng knowledge. We found no 
correlation between the strength of the literature association and the magnitude of the difference in 
expression level when considering changes as high as 5-fold: however, a significant correlation was 
observed (r = 0.41; p = 0.05) among genes showing an ex(^essfon difference of 10-fold or more, 
tmerestingty. this only held true for estrogen receptor (ER) positive tumors, not ER negative. MedGene 
identified a set of relaUvely understudied, yet highly expressed genes in ER negative tumors worthy of 
further examination. 



Keywords: biotnformatics • micfo-array * text mining ♦ gene*tfisease association • t>reast cancer 



Introduction 

At its current pace, ihe accumulalion of biomedical lileraiurc 
outpaces the ability of most researchers and clinicians lostay 
abreast of Uiclr own immediate Oelds. let alone cover a broader 
range of lofto. For cxaitipte. to follow a single disease. c.g., 
breast cancer, a researdier would liave had to scan 130 difTcrcnt 
journals and read 27 papcn per day in 199E1.' This problem Is 
accentuated wJUi hlgh-Uu-oughput leclvralogjcs sucU as DNA 
micro-arrays and prolcomics. which require the analysis of 
large dalascts Involving tlwusands of genes, many of which arc 
unfamiliar to a particular re5ean:her. In any microarray experi- 
ment, tliousands of genes may dcinonstralc statistically sig- 
nificant expression changes, but only a fraction of these may 
be relevant to the study. The ability to interpret Uiese dalasets 
would be enhanced if they could be compared to a compre- 
hensive summary of wliat is known about ail genes. Thus, there 
is a need to summarize existing knowledge In a format that 
allows for Uic rapid analysis of associations between genes and 
diseases or other specific biological coiKcpls. 

One solution to this problem is to compile structured digital 
resources, sucli as the Breast Cancer Ccne Database' and the 
Tumor Gene Database.' I-lowcvcr. us these resources arc tiand- 
curaicd. tlic labor-intensive review process becomes a rate- 
limiling step in tlie growth of IJk* database. As a result, these 
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databases have a limited scale and titc genes arc not selected 
in a systematic Tashlon. 

An alternative approach is automated text mining: a method 
wiiich involves automated infonnaUon exUacUon by searching 
documents for text strings and analyzing their frequency and 
context This approach has been used successfully In several 
instances for biological appUcatlons. In most cases. It has been 
applied to extract information about the relaUoruhlps or 
intcracUons that proteins or genes l>ave witli one another, in 
die lileraiurc or by funcUonal aTmotaUon.''^ Thus far, few 
publicaUon have applied text-mtning to examine the global 
rclalicnships between genes and diseases. Perez-Iratxela el al. 
auloinaticalfy examined iJie CO (Ccne Ontology) annotation 
of genes and Ihclr predicted chromosomal locaUons In order 
to Identify genes linked to inhedtcd disorders.* 

To obtain a more global understanding of disease develop- 
ment. It would be valuable to Incorporate information regarding 
all possible gcrw'dlsease relationships, including bioclicmical. 
physiological, pharmacologlcat. epidemiological, as well as 
genetic This Information would enable comprehensive com- 
jurlsons between large experimental dalascts and existing 
knowledge in the literature. Tliis would accomplish two things. 
First, il would serve to validate exf^erlments by demonstrating 
Uiat knoivn responses occur as predicted. Scxond. il would 
rapklly highlight wlilch genes arc corroborated by the literature 
and which genes are novel in a given context We liavc utilized 
a conipuUUonal approach to literature mlnitig to produce a 
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comprehensive sot ofgene-dbeasc relaUonslilps. In addtdon» 
we have developed a novel approach to assess the strength of 
each association based on the frequency of citation and co- 
dtallon. We applied this toot to help interpret the data trom a 
large micnHarray gene expression experiment comparing 
nonnal and cancerous breast tissue. 

Melhods 

MedCene Database. MedCene is a relational database, stor- 
ing disease and gene isirorination from NCBI. text mining re- 
sults, statistical scores, and hyperlinks to the primary lit- 
erature. McdGcne has a \veb-based user Interface for users to 
query tlie database (http^/hlpseq jnedJiarvaixl.edu/MedCenc/). 

Text Mining Algorithms. MeSH Dies were downloaded from 
the MeSH web site at hUM (NaUon Library of Medicine) (http^/ 
www.nlmj^.gov/mesh/meshhomcJitiiU) arxl human disease 
categories were selected LocusUnlc files were downloaded irom 
the LocusLfaik web site at NCBI (hUp7/www.ncbl. nlh.gov/ 
LocusLlnk/). OPDcial/preferTcd gene symbol, official/preferred 
gene name, and gene allemaiive symbols and names, all 
relevant annotations and tJRLs for each LocusUnk record, were 
collected. Gene search terms were used for literature searching 
and included all qualified gene names, gene symbols, and gene 
family terms. Primary gene keys, predominantly qualified gcj^ 
family terms and gene ofDdal/preferred symbols, were used 
to index Medline records. If ihc oilkial/preferred gene symt»ols 
did not meet the standards to be an index, then qualifled gene 
oflicial/prefcrred names were used. A local copy of Medline 
records (up to July. 2002) was pre>selccicd. 

A JAVA module examined the MeSH terms and then indexed 
each Medline record wiUi the appropriate disease tcnns. A 
separate JAVA itiodulc was usckl to examine die titles and 
absu^cts for gene search terms and then to index the gene- 
related Medline records with the relevant primary gene key(s). 

Statistical Methods. For every gene and disease pair, we 
counted records that were Indexed for both gene and disease 
(double positive hits), for disease only (disease single lilts), for 
gene only (gene single hits), and for neither gene nor disease 
(double negative hits) to generate a 2 x 2 contingency table. 
On the basis of the contingency tat>tc-framcwork. we applied 
different statistical metliods to estimate the strengtlt of genc- 
discasc relationships asid evaluated the results. Ttiese methods 
included chl-square analysis. Plsher*s exact probabilities, rela- 
tive risk of gene, and relative risk of disease" (http:// 
hipscq.med.harvardjedu/McdCcne/). In addition, we computed 
the "product of frequency', which Is the product of the 
proportion of disease/gene double hits to disease single hits 
and Ihe proportion of dlseasc/gcnc double hits to gone single 
hits. To obtain a normal distribution, we ti^nsfonmed ail the 
statlsUcal scores using Ihc natural logariUun. We selected the 
log of the produa of frequency (LPF) to vaUdate MedCene and 
to use for the analysis with the micro-array data. Spearman 
rank-correlation cocfTidenls were used to assess the lUicar 
relationship between I.PF and micro-array fold change in 
expression level. 

Global Arkalysis. Diseases wltJi at least 50 related genes were 
selected for clustering .analysis, and tlie LPF scorcis were 
nonnalizcd wiUi total score for each disease. Illcmrchical 
clustering was done with the •Cluster" software and the 
clustering result was visualized using 'TreeVicwcr" (http:// 
rana.lbl.gov/FJsGnSortwarc.htni). 
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Breast Tissue MIcro-Anays. Elgh^-nlne breast cancer 
samples (79% ER-positlve) and 7 normal breast tissue samples 
were selected from the Harvaid Breast SPOI^ frozen tissue 
repository aiui were representative of the spectrum of histo- 
logical types, grades, and hormone receptor Immuno-pheno- 
types of breast cancer. Blotlnylated cRNA. generated from the 
total RNA extracted from the bulk tumor, was hyk>ridized to 
Afl^etrix U95A ollgo-nudeotlde micro-arrays. Ihese mlao- 
arrays consist of 1 2 4(M) probes, which represent approximately 
dO(X) genes. Raw expression values were obtained using C£NE> 
CHIP software from AHymclrlx, and then further analyzed using 
the DNA-(^p Analyzer (dChip) custom software. 

Results 

Automated Indexing of Medline Records by Disease and 
Gene. To study the gene>disease assodaUons in the IlterBture, 
we first compQcd complete lists for human diseases and human 
genes. To index aU Medline records that were relevant to 
human diseases, tiie Medical Subject Heading (MeSH) index 
of Medline records was utilized. MeSH Is a controlled medlcai 
vocabulary from the National Library of Medicine and consists 
of a set of terms or subject headings that are an^ged In both 
an alphabetic and an hierarchical structure. Medlir)e records 
are reviewed manually and MeSH terms are added to each with 
software assbtance.* *^ Twenty-three human cfiscase category 
headings along with all of their child terms (see the Supporting 
Ir^ormation. Supplemental Table I. or visit http://hlpseq. 
mediiarvard.edu/MedGene/pubIicalion/sJTable I Jiiml) were 
selected from the 20D2 MeSH index creating a Osi of 4033 
human diseases. 

No index comparable to the MeSH Index exists for genes, 
and thus, it was necessary to apply a string search algorithm 
for gene names or symbols found In Medline tcxL A complete 
list of genes, gene names, gene symbols, and frequently used 
synonyms were coUecled from the LocusUnk database at 
NCBL'» " which contains 53 259 independent records keyed 
by an ofllcial gene symbol or name (June IBfi*, 2002). For the 
purposes of this study, no distinction was made between genes 
and their gene products. Authors often use the same name for 
both, differentiating the two only by the use of italics, if at aU. 
For tlie intended use of this study, this lack of distinction Is 
unlikely to have a large effect and may In fact be beneficial. 

Initial attempts to search Ihc Utcraturv using these lists 
revealed several sources of false posIUves and false negatives 
(Table 1). False posillves primarily arose when the searched 
lenn had other meanir^s. whereas false negatives arose from 
syntax discrepancies necessitating the development of filters 
to reduce these errors. The syntax issues were readily harxlled 
by including altemate syntax forms in the search terms. The 
false positive cases, caused by duplicative and unrelated 
nicanlngs for the terms, were more diflkult to manage. Where 
possible, case sensitive string mapping reduced inappropriate 
citations. In many cases, however, this was not sufficient and 
the terms had to be eliminated entirely, thereby reducing the 
false positive rale but unavoidably undcr-n^prcscnting some 
genes. 

For ihe purposes of data tracking, a primary gc)Wi key was 
selected to represent dU synonynis that correspond to each 
gene. Medline rcconls were indexed with a primary gene key 
wlien any synonym for that key was found in llic llUc or 
abstract Casc-insensitive string mapping was used for alt 
searches except as note<t above. No additional weight was 
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Table 1. Systematic Sources of False PoslUves and False Negalfves in Unfiltered Data* 



sotnceoreiror 



I type 



laicrcohitioii 



gene symbol/name 
Is not unique 



gene j^znbol is 

unrelated abbieviailon 
gene symbol/name 

has language meaning 
nonstandard syntax 
unoflicial gene name/;^mbol 
nonspedOed gene name 



false positive MAQ-tayeita 

assodaied ^ycoproteln 
A^O-malignancy-assodated 
protein 

fake positive /M^p^Ud homologue (mouse). 

pallldin (also abbrev. for Penn^lvanla) 
false positfive tVA9->Wlskott-Aldridi Syndrome 

(also the wofd >wa$") 
false negaUve BAG-I Instead of BAG I 
false negative F£9tns(ead oT 7753 
false negative estrogen receptor instead of 

Estrogen receptor 1 



eliminate this tenn 

eliminate this tetm 

case-sensitive siring search 

add dash term 

add all gene nicknames 

add fiunlly stem term 



• In prcBmlnaiy sludtes^ Medline was xeorchcil for co -occurrence of genes nnd dbeases and the resuIUng output vras cvalualed to IdcnUfy eiTor sources Ibat 
were amerwblo to (potul flUeis. Bach error source is categorized by the ^pe of enor It causes fiibe positives ere suggested rebtlonshtps thai are not real and 
fabe n^tflMS are real rdaUon^ps that are undetiepresenUd.Tbe filler solullons used arc Indicated. Note that In seme cases, the filler sohiUon Itselflfitroduccs 
error. In general, error rates martwlifd sensIIM|y. even at the expense of spediidty If iweded. 



added for mtriUple occunenoes of a term or the co*oocurrence 
of imiUJpIc synonyms for the same gene k^. 

Medline records were searched with all quaODed gene 
Idenlifleis* sucii as the otncbl/preTcfTed gene symbol, the 
ofndal/preferred gene name, all gene nidcnamcs arid all syntax 
variants. In situations where there are several members of a 
gene family or splice variants, some authors prefer lo use a 
shortened gene &mity name» e.g.. estrogen receptor Instead of 
estrogen receptor 1 (£53?i)* creating a source of false rwgalivcs. 
For this reason, gene family stem terms were created for all 
genes tl^at have an alpha or numerical sulQx (e.g.. JL2RA, TCFfi, 
ESRi, etc.) and then used to search the literature. The family 
stem terms were handled separately from the spedlk gene 
names so tiiat it would be dear when linkages were made to 
the gene family versus a q^cdDc member In that family. 

To improve performance and accuracy, some pre-seleciion 
was applied to the records that were scarmed. Brst review 
articles were eliminated to avoid redundant treatment of 
dtations. Second. non-EngUsh^umaU were removed because 
the natural language filters were only relevant to English 
publications. Finally, journals unlikely lo contain primary data 
about gene-disease relationships were also removed (e.g.. Int. 
J. Health Educ., Bedside Nuise, and / HeaJtit Eton.). Together, 
these niters reduced the 12 198 221 Medline publications (July 
2002) by 37%. 

Ranking the Relative Strengtiss of Gcnc-Diseasc Assoda* 
tions. In tolal, there were 618 708 gene-disease co-citations, 
in which 16% (9297) of all studied genes had been associated 
to a disease and 96% (3875) of all diseases liad been associated 
to at least one gene. To rank the relative strengths of gene 
disease relationships, we tested several different statistical 
methods and examined the results. With ilie exceptton of the 
relative risk estimates, the methods provided similar results 
with respect to the rank order of the genc-dlsease association 
strengths. However, after comparing the results to otiicr 
databases and after consulting disease experts, the log of Uie 
product of frequency (LPF) was selected for further analysts 
because It gave the best results overall. 

Validation of MedGcnc. In developing this tool, it was 
important to minimize the number of missed genes (false 
negatives) arKi miscalled genes (fake positives). However. In 
s)tuatk>ns when these goals were in connict. indusivcncss was 
prioritized. To delcrmtne tlie fabe negative rale in MedCcne. 
breast cancer was used ns a lest case because It was assodatcd 
with more genes tlian any oilier human disease and because 




Figure 1. EsttmaKon of the false negative rate by comparison 
with handHiuraied databases. The txeasi cancer-related genes 
Identified by MedGene were compared with those listed In 
several other datat)ase5 including the Tumor Gene Database 
(TGO).' the Breast Cancer Gene Daiabase(BCG).* GeneCards 
(GC)^' and Swissproc.^* Genes were considered false negatives 
if they were represented in al least one of these other databases 
and not in MedGene and their link to breast cancer was sup- 
ported by at least one literature reference. Alt literature references 
were verified by manual review lo confirm their validity. The 
number of genes in each datat>as8 or shared by more than one 
database is indicated. The false rwgative rate was calculated by 
genes missed at MedGene (?6)/toial number of rKxioverlapping 
genes in other databases (285). 



Uiere were several publk databases that link genes to breast 
cancer. We compared the list of breast cancer-related genes 
from MedGene to these databases, illustrated in Figure 1. 
Among the 285 distinct breast cancer-rclaicd genes that were 
supported by at least one Uierature ciUb'on in these hand- 
curaicd databases. 26 were absent from MedGene. suggesting 
a false negative rate of approximately 9%. To determine why 
these were missed, all literature references for Uiesc genes (80 
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papers) were reviewed manually (see Ihe Suj^poftinig Infoima- 
tion* SupplemerUal Table 2. or visil hti^://l^pseq.ineiL 
harvanLedu/MedCene/pubUcadon/sJTable 2.htin0. Among 
these papers, most false negatives were caused by nonstandard 
gene terms or gene terms eliminated by cor spedfidty filters. 
Few genes were missed because th^ were only mentioned in 
review papers (0.4%) or they appeared only In the body of (he 
roanuscr^t but not the abstract or tlUc (1.1%). Of note. 
MedGene Identifled approximately 2000 additional breast 
cancer-related geries not listed In any other database. 

To assess the positive enror rate, two complementary 
approaches were used: a detailed analysis of one dbease and 
a ^obal examination of 1000 diseases. The detailed approach 
examined the false positive error rate and Its sources. «^reas 
the global approach tested whether the overall residts ntade 
biomedical sense. 

Using the LPF, 1467 genes related to prostate cancer were 
assembled in rank order. We then retrieved approximately 300 
Medline records each for the highest ranked 100 and the lowest 
ranked ZOO genes and manually reviewed the tlUes and 
at>stfacts to determine the verily of tlie assoclatlorL Nearfy 80% 
of the highest ranked 100 genes fell Into one of the Ove 
categories that reflect meaningful gcne-dlsease relationships 
(see the Supporting Information, Supplemental Table 3, or visit 
hltp://hipscq.med.harvard.edu/MedCenc/ publication/ 
sJTable 3,html). Among the lowest ranked 200 genes, ap- 
proximately 70% reflected true relationships. Of the 600 records 
reviewed, tiiere were only two In which the association between 
the gene and the disease was described as negative. Both were 
genes with very low scores. In both cases, the autliors did not 
argue the absence of any relationship, but rather that a 
particular feature of the gene or protein was not shown to be 
related to human prostate cancer.*' " 

The coincklence of some gene symt>ols with medical ab- 
breviations, chemical abbreviations and biological abbrevia- 
tions resulted in most of the false positives (see the Supporting 
Information. Supplemental Table 4. or visit http^/hlpse- 
q.med.harvani.edu/MedGene/publicatJon/sjrablc 4.htm0. ern- 
pliaslzing the importance of the filters that were added in the 
search algorithm fTable 1). Without the filters, the false positive 
rale more than doubled, and the false negative rale rose 
draiTiatically (data not shown). For example, among the papers 
about breast carKcr. there were only 12 Medline records that 
referred lo ESRI and 10 to whereas almost 2000 papers 
mentioned estrogen receptor without specifying ESRl or ESR^t 
this latter group was detected by the family stem tcmi Oiler. 

To further validate these results, a global analysis of the gene- 
disease relationships described by MedCene was performed. 
For this experiment. It was reasoned that Ihe more closely 
related tlie diseases arc to one another, the more they will be 
related to the same gene sets. Thus, if the relationships defined 
by MedCene accurately reflected the literature, tlien an unsu- 
perviscd hlcrarclUcal clustering of the gene data should group 
diseases In a manner consistent with common medical think- 
ing. Conversely, if the clustered diseases do not make sense 
biologically or medically. It may reflect excessive false positives, 
false negatives, or inappropriate scoring of the data. 

To execute this experiment. tl\e gene sets and the corre- 
sponding LPF values for 1000 rar>domIy selected diseases (eaci) 
with al least 50 gene relatioruhlps) were used as a datasct for 
clustering the diseases. A review of the results showed that the 
resulting disease clusters were bidced logical based upon 
common medical knowledge (sec the Supporting Infonnatioj). 
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Supplemental Figure 1. orvlsit hi^/hipseq jnedharvardedn/ 
MedCene/publicalion/s.J]gure liktmO* For example. In one 
such duster shown In Figure 2. diabetes and its complications 
grouped ti>gether btkI were also closely linked to diseases 
associated with starvation states. ■ 

The number of genes associated with a given disease can. 
be estimated fay atlftislitig the MedGene number up by ihefidse 
negatNe rate ('•O^ and down by the false positive rale ('^26% 
on average). Using this, the average disease has 103.7 i: 45.3 
(mean ± sjd.) gcniss associated with It, although the range Is 
quite broad with 2359 genes related to breast cancer, 2122. 
genes related to lung caiKer and no genes related lo a number 
of diseases. 

Applying MedGene to the Analysis of Large Datasets. Access 
to a comprehensive sumonaiy of the genes linked to human 
diseases provided an opportunity to analyze data obtained from 
a high-throughput experiment We compared the MedCene 
breast cancer gene list to a gene expression data set generated 
from a micro-array analysis comparing breast cancer and 
rK)rmal breast tissue samples. Micro-array artalysis Identified 
2286 genes that had greater than a 1-fold dilference in mean 
expression level between breast cancer samples and normal 
breast samples. Using MedCene. we sorted the 22B6 genes into 
four classes: 555 genes directly linked to breast carKer In the 
literature by gene term search (flrst'degrec association by gene 
name): 328 genes directly linked by family term search (first- 
degree association by family term): 1021 genes linked to breast 
cancer only through other breast cancer ger>es (second-degree 
association); and 505 genes not previously associated with 
breast cancer. (See the Supporting Information. Supplemental 
Figure 2. or visit http://hipseqjned.harvard.edu/MedCene/ 
publicallon/sj^gure 2.html.) Among the 505 previously un- 
related genes, 467 were either newly Idcntincd genes or genes 
Uiat had not previously been associated wilii any disease. 
Among the remaining 38 genes. 9 had been related to other 
cancers. spedAcally esophageal, colpa uterine, skin, and cervix. 

To determine whether the gcrtcs highlighted by the micro- 
anay analysis were more likely to have been previously linked 
to breast cancer In the literature, we created a two-dimensional 
plot of the fold change of expressfon krvel tietwecn breast 
cancer and normal tissue versus the literature score (LPF) 
(Figure 3A). There was a broad spread of expression changes 
among the genes direct^ linked to breast cancer rariging from 
less than 1-fold char^ (68%) to over 40.fold (0.3%). i^taUy. 
the majority of genes with greater than 10-fold expression 
changes were linked to breast cancer by flrst-dcgree assoda- 
don. 

Among all 754 genes directly linked to breast cancer in the 
literature, there was no correlation between LPF and micro- 
array fold change (r= 0.018, p-value = 0.62). However, when 
we stratified the analy^ based on the magnitude of the fold 
d^ngc. we observed an increasing trend in correlation (Figure 
38) suggesting tliat genes with a more substantial change In 
expression level were more likely to have a stronger association 
In tlic litcrTiture. For genes that had 10-fold change or more In 
exprcsston level, tlie correlaikm increased to 0.41 (p-value » 
0.0S). 

When we evaluated the micro- array data separately for ER 
positive and ER negative tumors, the trerKi In com*lation 
between fold change and literature score was higMy dependent 
on estrogen receptor status. Interestingly, there was a similar 
trend in correlation for ER positive tumors, but no trcrKl in 
correlation for ER negative tumors. 



Anntysis of Data Using Advannd litmtun Mining 



research articles 



lHlirSillrt[Illltk,[;Sniin^liln^(^ 

iiiiiii 





CoxsftcklevlKus Xntvctloiuf 
Obesity in Piaa>ates 
Diabetic Kotoacidoslo 
Olvoose Intoleranoe 

Dlal>«t«s MellJlt\ia« Doia>Inaulin-D»p«n<l«at 
Diabotea MAXlltua« Znsul In-Dependant 
Pregnancy in DXabotlcs 
Diabetic tletlxK>patlLy 
Diabetic Angiopathies 
Diabetic Nouropafchies 
Oiycosurla 
Byperlneulinima 
Hyperinsul i neffiia 
Hypoglyceaia 
Hypergiyceccftla 

Diabetes MeXlitu6« Experimental 
• Diabetes HeXlltus 
Diabetes* Gestational 
sftaxvaCrwa 
Jaundice, Heonafcal 
Brain Bdena 
Fulxkonary Bdesia 
HutritioA Disorders 
Rwashiorkor 
Critical Illness 
Burns 

Diabetic Nephropathies 
Albuminuria 
Insulinoma 

Figure Z. Global validailon by clustering analysis. 2CA). The gene sets and the corresponding LPF values for 1000 diseases, each with 
at leasi 50 gene relationships, were used in an unsupen;ised clustering of the diseases based on the gene patterns associated with 
them A sample of the data is shown here. 2(B). One of the result^g clusters is shown that corresponds to blood sugar slates. Diabetes 
terms (atiove the line) and starvation slates terms (under the line) clustered together. Within these groups, there Is also clustering of 
diabetic small vessel complications, altered scrum chemistries, mitriUonal disorders, etc.(SuppTemenial Figure 1: http:/Ai{pseq.med. 
harvard.edurt^edGene/|XJbHcation/s.Figure 1 .himl). 




Finally, lo validate our llndings, wc computed similar cor- 
relations between the breast cancer expression data and 
LPF scores generated by McdCcnc for liypcrtcnslon. a 



disease unrelated lo breast cancer. As expected, wc did not 
observe an Increasing trend in correlation for hypcrtcn* 
sion. 
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Mlcroarray Fold Change (Cancer/Normal) 




MIcroarray Fold Change 



Figure 3. Relalionshlp between literature score and functional data Tor breast cancer. 3A. The data from an expression analysis of 
samples for breast tumors arul normat breast tissue were analyzed lo Indicate the foU difference of expression level between breast 
tumor and normal sample (cutoff > 3'fold change). The fold changes were plotted against the literature score for the same gene set. 
Green dots represent first-degree association by gene search, blue dots represent first-degree association by family search arwl red 
dots represent no-associatlon. Some welt-studied genes, such as BRCAZ (pink circle), are not reflected by a substantial difference In 
expression level. Furthermore, the majority of genes that have no association with breast cancer In the literature had less than lO-foW 
expression changes (shaded area). 3B. The Spearman rank-correlation cocfflcienis between literature score (LPF) and the fold change 
of expression level between tumor and normal breast samples ()«>axis) in relation lo the amourX of fold change of expression level 
(x-axis). Gene rank lists were generated for breast cancer (blue) and hypertension (pink). Correlations were also computed between 
the breast cancer gene LPF scores and fold change expression data among esuogen receptor positive tumors only (light l>lue) arid 
estrogen receptor negative tumors only (purple). 
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bfcfut neoplasms 


faypert^nsfon 


rliciiinaliild aitfaritts 


blpdardisefder 


aiberosderosls 


estrogen receptor 


REN 


OA 


ERDAt 


auuiuiuiJi utciu 


PCR 


uBr 


JriraisiFivn 


CXMP90 




ERBBZ 


LEP 


CRP 


PFKL 


LDIB 


Bi^lAi 


AiiT 


AC 




ELN 


BRCA2 


JNS 




inn 


ARCI 

nnvM 


ECFR 


kallikrein 


tiLA-UliDl 




JlrVD 


CtPlS 


ACS 


not 

UKi 


H797A 


APQAt 


TFPi 


endothelln 


inienetilun 


UKUJ 


hSRi 


P5£M? 


SiOOAB 


tNF 


REM 


LPL 


7PS3 


BDK 






PTiMt 

plasminogen 


CESS 


DIANPH 


collagen 


DRD4 


activator inKtbi&xr' 


CEACAMS 


SARi 


ILIA 


HTR2C 


vascular cdl 


ERBB3 


PIH 


ACR 


RELN 


adhesion molecule 


cydin 
C0X5A 


CDSS 


TNFR^IZ 


DBH 


ATOm 


ALB 


lU 


MAOA 


VWP 


cathepsin 


CYPllBZ 


CHJJLi 


com 


itiS 


ERBB4 


MATZB 
angiotensin 


US 


HTR2A 


ARCZ 


TRAM 


receptor 


interleukin i 


SYNJI 


ABCAi 




matrix 






CCNDI 


AGTR2 


meialloprotelnase 


INPPi 


OLRI 


ECF 


NPPA 


interferon 


NEDD4L 


coDagen 


MUCi 


LVM 


CD68 


FRAI3C 
transducer of 


MCP 


Insulln-Ukc 


DBH 


iL4 


ERBBZ 


lipoprotein 


DCLZ 


NPY 


ILI7 


BAJAP3 


AP0A2 
Intercellular 


mucin 


POMC 


MMP3 


ATPIBS 


adhesion molecule 


FCF3 


neuropeptide 


SIL 


DRD5 


RAB27A 



* McdGcnc resuUs for the top 25 genes MsodaUd vf$ih breast ntopSsana, bypcileiuion. rbeitfnatoid aitbfitU. bipolar dtaofdcr. and alfaeiosclcrasls, rer^iectlv^, 
ranked by LPF scores. The hyperlink lo an the papcis co-dUng the gene and the disease Is avaUaUe at MedGene website (hitp://hIpscq.inedJunrBrd^/ 
McdCene/). 



Discussion 

The Human Ccnome Projccl licralded a new era in biolog^al 
research where the emphasis on undeislandli^ specific path- 
wajrs has expanded to global studies of genomic organization 
and biological systems. H|gh-lhroughpul technologies can 
provide novel insight into comprehensive biological function 
but also introduces new challenges. The utility of these 
Iccliiiologics is limited Lo the abUlly to generate, analyze, and 
inlerprel large gene lists. MedGcne, a relational database 
derived by mining the informaUon in Medline, was created to 
address this need. MedCene usen can queiy for a rank-ordered 
list of human gene-disease relationship fTable 2) for one or 
more diseases. Each entry is hyperllnkcd to the original papers 
supporting each association and to other relevant databases. 

MedCcne Is an Innovative extension of previous text mining 
approaches. Percz-lraUcta et al. used the CO annotation and 
their chromosomal locations to predict genes that may con- 
tribute to inherited disorders.* MedCene takes a broader view 
and includes all diseases and all possible gene-disease iclatlon* 
ships. FurthernK)re. MedCene utilizes co-dtatkm to IrKlicate a 
relationship rather than CO annotation, which Is limited to the 
subset of genes that have CO annotation. Our approach is 
complcmentiiry to that taken by Chaussabel and Sher. vvIh> 
used the frequency of co-cited terms to duster genes bito a 
hierarchy of gene-gene relationships.^ 

A unique aspect of this tool is the ability to assess the rclaUvc 
strengths orgcne-dtscasc relationships based on the frequency 
of both co-citaUon and single cJtaUon. This presupposes that 
nwst co-cltations describe a poslllvc assodalion. often rcrcrrcd 
lo as publication bias" and is supported by our observations 



that negative associations are rare (Supplemental Table 3: 
htLp://liip5eqjncd.h3rvard.edu/McdGcne/publicaUon/5.Ta- 
ble SJitml). Of course, relationships established by frequency 
of co-citation do not necessarily represent a true biological link; 
however, it Is strong evidence to support a true relationship. 

Another Important feature of MedCene Is the Implementa- 
tion of sofhvare filters that substantially reduced the error rate. 
We estimate that less than 10% of all associations were missed 
and at least 70% of even the weakest assocbtions were real. 
For this study, all of the fillers that we applied were general 
ones. e.g„ expanding the list of all gene names to address the 
different syntax forms used by different journals, eliminating 
gene names that correspond to common English words, etc. 
The majority of the remaining search term ambiguities were 
idtosyrxrallc and dlfBculL to identify systematically without 
causbtg a significant rise in false negatives. AItf:mative ap- 
proaches, such as the examination of the nearest neighbor 
tenns. need lo be considered to further reduce the false posithre 
rate. 

It is not uncommon to see expression changes In micro- 
array experiments as small as 2-fold reported in the literature. 
Even when these cjq^resslon changes are statistically significant. 
It is rK>t always dear if they are blologlcaUy meaningful. When 
comparing expression levels of disease to normal tissue, one 
expects an enrichment of known disease-related genes to 
appear in the altered expression group. MedGcne provided a 
unique opportunity to test this notion In the context of existing 
knowledge on a novel breast cancer micro-array dataseL For 
genes displaying a 5-fold change or less In tumors compared 
lo normal, there was no evidence of a correlation between 
altered gene expression and a known role in the disease. This 
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Table S. Genes with Large Expression Changes In ER- but 
Not f n ER-f Breast Tumors. 



gene symbol 


folii Chinee CER-*) 


fbldclian0e(ER' 


XkTHBS 


t ft 


6l0wB 


BRS3 


\Z 


RQ A 


DKKI 


\2 


CO B 
DsUI 


ZICi 


1^ 


59.6 


TLBI 


1.0 


38.5 


XIAA0680 


Z& 


33.2 


CDKN3 


IjO 


30.B 


EB12 


4J) 


27.9 


CZMB 


3^ 


Z1.9 


STKI8 


4.7 




CPR49 


1.0 • 


I4.fi 


MYOSO 


1.6 


14.4 


lADI 


-1.0 


13.5 


P0LE2 


AZ 




HMC4 


4.4 


12.9 


BCL2U! 


-1.2 


123 


UtP8 


23 


12.2 


CCNB2 


IjO 


il.a 


CCNE2 


Am 


11 R 
1 1.v 


FCB 


-4J 


11.1 


KN5L6 


2.9 


10.9 


WFS 


3.0 


10.2 


SERPINH2 


4£ 


10.2 


YAFI 


1.0 


10.0 


IPHB 


-1.3 


-10.4 


TCBAZ 


-I.I 


-10.8 


TFFI 


1.3 


-11.4 


C0LI7AI 


-4.1 


-15.7 


POF5 


i.I 


-1B.2 


BFACI 


-4.6 


-223 


• FDZKl 


-I.I 


-36.8 


VEGFC 


-2.8 


-51.5 


MUC6 


-1.4 


-64.9 


SERFJNA5 


-1.0 


-83.1 


MBSl 


-1.6 


-85J) 


CAIZ 


2.4 


-150.3 



Tal>)e 3. MetiCene iitciiiiAcd a set of tcbUvdy tindenludled, yrt highly 
expressed genes In ER negative, bui not ER posiUve breast tumors. All of 
tltesc genes have eiihcr never been co-ctted with breast cancer or have a ' 
weak assoctnlion ejtcept those maikcd with an *. 



reflects the many genes whose role In breast cancer may not 
involve large changes In expression In sporadic tumors (c.g., 
BRCAl ;u\d BRCAZ^ and genes ^vhosc modest changes In 
expression may be unrelated to the disease. Strikingly, among 
genes with a 10>fold change or more in expression level, there 
was a suong and significant coirelaUon t>etWGen expression 
level and a published role In the disease, providing the first 
global validation of the micro-array approadi to Identifying 
diseasc-speoBc genes. 

Tlie results derived from MedCcne have two ImpUcadons. 
First, a careful hunt for corroborating evidence of a role in 
breast cancer should precede any further study of genes with 
less than 5-fold expression level changes. Second, any genes 
with 10-fold changes or more arc likely to be related to breast 
cancer and warrant attention. It Is likely that this threshold will 
change depending on the disease as well as tlie expcrlmcnL 

Interestingly, the observed corrdallon was only found among 
ER-positive tumors, not ER-negativc. This may reQect a bias 
in the Utetaturc to study (he more prevalent type of tumor in 
Uie population. Furthermore, this emphasizes that caution 
must be taken when interpreting experiments that may contain 
subpopulations that behave very differently. The McdCcne 
approach identified a set of relatively understudied, yet lilghly 
expressed gqiKSS in ER- negative tumors lliat arc worthy of 
furtlicr examination (Table 3). 



In conduslon, we have developed an automated method of 
summarizing and organizing the vast btomedlcal Utecature; To 
our knowledge, the resulting database Is the most comprchen- 
shre and accurate ofits kind. By geneiatb^ a score that ledects 
the strength of the association. It provides an important tool 
for the rapid and flexible analysis of large datasets from various 
high-throughput screening experiments. Furthermore. It can 
be used for Meeting subsets of genes for lunctional studies, 
for buikiing disease-specific anays. for looking at genes com- 
mon to mulbple diseases and various other high-throughput 
applications. In the future, it will be possible to enhance the 
utility of the MedGene database by bulldlrig links between 
genes and other MeSH terms as well as other bk>Iogical 
processes and concepts, such as cell division and responses to 
small xiudecules. 
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Review 

Paul A. Haynes Proteome analysis: Biological assay or data archive? 

Steven P. Cygl 

Daniel Figeys jj^jj review we examine the current stale of proteome analysis. There are 

Ruedi Aebersold ^yxtcq main issues discussed: why it is necessary to study proleomes; how pro- 

teomes can be analyzed with current technology; and how proteome analysis 
Department of Molecular ^^^^ enhance biological research. We conclude that proteome anal- 

Biotechnology, University of yg-^ essential tool in the understanding of regulated biological systems. 

Washington, Seattle, WA, USA Current technology, while still mostly limited to the more abundant proteins, 

enables the use of proteome analysis both to establish databases of proteins 
present, and to perform biological assays involving measurement of multiple 
variables. We believe that the utility of proteome analysis in future biological 
research will continue to be enhanced by further improvements in analytical 
technology. 

Contents resolution two-dimensional gel electrophoresis (2-DE), 

detected in the gel and identified by their amino acid 

1 Introduction 1862 sequence. The ease, sensitivity and speed with which gel- 

2 Rationale for proteome analysis 1862 separated proteins can be identified by the use of recently 

2.1 Correlation between mRNA and protem developed mass spectrometric techniques have dramati- 
expression levels 1863 ^jj^ increased the interest in proteome technology. One 

2.2 Proteins are dynamically modified and pro- ^^^^^ ^^^^ attractive features of such analyses is that com- 
cessed 1863 ^^^^ biological systems can potentially be studied in their 

2.3 Proteomes are dynamic and reflect ihe entirety, rather than as a multitude of individual corapo- 
slate of a biological system 1863 ^^^^^ ^^^^^ ^ far easier to uncover the many com- 

3 Description and assessment of current pro- pj^^^ ^^^^ obscure, relationships between mature 
leome analysis technology 1863 gene products in cells. Large-scale proteome characteriza- 

3.1 Technical requirements of proteome tech- ^-^^ projects have been undertaken for a number of dif- 
nology 1863 organisms and cell types. Microbial proteome pro- 

3.2 2D electrophoresis - mass spectrometry: a j^^^^ currently in progress include, for example: Sacchaw- 
common implementation of proteome anal- ^^^^^ cerevisiae [2], Satmonella enterica (3], Spiroplasma 
ysis 1864 fnciliferum [4], Mycobacterium tuberculosis [S], Ochrobac- 

3.3 Protein identification by LC-MS/MS, capil- anthropi [6], Haemophilus influenzae 17], Synecho- 
lary LC MS/MS and CE-MS/MS 1865 ^^^.^ jgj^ Escherichia coli (9), Rhizobium legumino- 

3.3.1 LC-MS/MS 1865 ^^^^^ j^^j^ Dictyosteiium discoideum (llj- Proteome 

3.3.2 Capillary LC-MS 1865 projects underway for tissues of more complex organ- 

3.3.3 CE-MS/MS 1865 j^^^^ include those for: human bladder squamous cell 

3.4 Assessment of 2-DE-MS proteome tech- carcinomas (12), human liver (13), human plasma 113), 
nology 1866 ^uman keratinocytes (12). human fibroblasts |12], mouse 

4 Utility of proteome analysis for biological kidney [12), and rat serum (14]. In this manuscript we cri- 
research 1868 ^-^^^jy ^^^^^^ concept of proteome analysis and the 

4.1 The proteome as a database 1868 technical feasibility of establishing complete proteome 

4.2 The proteome as a biological assay .... 1868 ^^p^^ discuss ways in which proteome analysis and 

5 Concluding remarks [870 biological research intersect. 

6 References 1870 



1 Introduction 

A proteome has been defined as the protein complement 
expressed by the genome of an organism, or, in multicel- 
lular organisms, as the protein complement expressed by a 
tissue or differentiated cell (1). In the most common im- 
plementation of proteome analysis the proteins extracted 
from the cell or tissue analyzed are separated by high 
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2 Rationale for proteome analysis 

The dramatic growth in both the number of genome 
projects and the speed with which genome sequences 
are being determined has generated huge amounts of 
sequence information, for some species even complete 
genomic sequences (|15-171). The description of the 
stale of a biological system by the quantitative measure- 
ment of system components has long been a primary 
objective in molecular biology. With recent technical 
advances including the development of difTerentia! dis- 
play-PCR 1 18), cDNA microarray and DNA chip techno- 
logy 119, 20) and serial analysis of gene expression 
(SAGE) 121, 22], it is now feasible to establish global and 
quantitative mRNA expression maps of cells and tissues, 
in which the sequence of all the genes is known, at a 
speed and sensitivity which is not matched by current 
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protein analysis technology. Given the long-standing 
paradigm in biology that DNA synthesizes RNA which 
synthesizes protein, and the ability to rapidly establish 
global, quantitative mRNA expression maps, the ques- 
lions which arise are why technically complex proteome 
projects should be undertaken and what specific types of 
information could be expected from proteome projects 
which cannot be obtained from genomic and transcript 
profiling projects. We see three main reasons for pro- 
teome analysis to become an essential component in the 
comprehensive analysis of biological systems, (i) Protein 
expression levels are not predictable from the mRNA 
expression levels, (ii) proteins are dynamically modified 
and processed in ways which are not necessarily 
apparent from the gene sequence, and (iii) proteomes 
are dynamic and reflect the state of a biological system. 

2.1 Correlation between mRNA and protein expression 
levels 

Interpretations of quantitative mRNA expression profiles 
frequently implicitly or explicitly assume that for specific 
genes the transcript levels are indicative of the levels of 
protein expression. As part of an ongoing study in our 
laboratory, we have determined the correlation of expres- 
sion at the mRNA and protein levels for a population of 
selected genes in the yeast Saccharomyces cerevisiae 
growing at mid-log phase (S. P. Gygi et al.^ submitted for 
publication). mRNA expression levels were calculated 
from published SAGE frequency tables 122]. Protein 
expression levels were quantified by metabolic radiola- 
beling of the yeast proteins, liquid scintillation counting 
of the protein spots separated by high resolution 2-DE 
and mass spectrometric identification of the protein(s) 
migrating to each spot. The selected 80 samples consti- 
tute a relatively homogeneous group with respect to pre- 
dicted half-life and expression level of the protein pro- 
duels. Thus far, we have found a general trend but no 
strong correlation between protein and transcript levels 
(Fig. 1). For some genes studied equivalent mRNA trans- 
cript levels translated into protein abundances which 
varied by more than 50-fold. Similarly, equivalent steady- 
slate protein expression levels were maintained by trans- 
cript levels varying by as much as 40-fold {S. P. Gygi 
et ai^ submitted). These results suggests that even for a 
population of genes predicted to be relatively homoge- 
neous with respect to protein half-life and gene expres- 
sion, the protein levels cannot be accurately predicted 
from the level of the corresponding mRNA transcript. 

2.2 Proteins are dynamically modified and processed 

In the mature, biologically active form many proteins are 
post-translalionally modified by glycosylation, phosphor- 
ylation, prenylation, acylation, ubiquitinalion or one or 
more of many other modifications |23| and many pro- 
teins are only functional if specifically associated or com- 
plexed wiih other molecules, including DNA, RNA, pro- 
teins and organic and inorganic cofactors. Frequently, 
modifications are dynamic and reversible and may alter 
the precise three-dimensional structure and the slate of 
ncliviiy of a protein. Collectively, the state of modifica- 
tion of the proteins which constitute a biological system 
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Figure I. Correlation between mRNA and protein levels in yeast cells. 
For a selected population of 80 genes, protein levels were measured 
by "-S-radiolabeling and mRNA levels were calculated from publi- 
shed SAGE tables. Inset: expanded view of the low abundance region. 
For more experimental details, also sec Figs. 5 and 6, (S. P. Gygi et a/., 
submitted). 



are important indicators for the state of the system. The 
type of protein modification and the sites modifted at a 
specific cellular state can usually not be determined 
from the gene sequence alone. 

2.3 Proteomes are dynamic and renect the state of a 
biological system 

A single genome can give rise to many qualitatively and 
quantitatively different proteomes. Specific stages of the 
cell cycle and states of differentiation, responses to 
growth and nutrient conditions, temperature and stress, 
and pathological conditions represent cellular states 
which are characterized by significantly different pro- 
teomes. The proteome, in principle, also reflects events 
that are under translational and post-translational con- 
trol. It is therefore expected that proteomics will be able 
to provide the most precise and detailed molecular des- 
cription of the state of a cell or tissue, provided that the 
external conditions defining the state are carefully deter- 
mined. In answer to the question of whether the study 
of proteomes is necessary for the analysis of biomolec- 
ular systems, it is evident that the analysis of mature pro- 
tein products in cells is essential as there are numerous 
levels of control of protein synthesis, degradation, 
processing and modification, which are only apparent by 
direct protein analysis. 



3 Description and assessment of current proteome 
analysis technology 

3.1 Technical requirements of proteome technology 

In biological systems the level of expression as well as 
the states of modification, processing and macro-molec- 
ular association of proteins are controlled and modu- 
lated depending on the state of the system. Comprehen- 
sive analy.sis of the identity, quantity and stale of modifi- 
cation of proteins therefore requires the detection and 
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quantitation of the proteins which constitute the system, 
and analysis of difTerentially processed forms. There are 
a number of inherent difficulties in protein analysis 
which complicate these tasks. First, proteins cannot be 
amplified. It is possible to produce large amounts of a 
particular protein by over-expression in specific cell sys- 
tems. However, since many proteins are dynamically 
post-translationally modified, they cannot be easily am- 
plified in the form in which Ihey finally function in the 
biological system. It is frequently difficult to purify from 
the native source sufficient amounts of a protein for 
analysis. From a technological point of view this trans- 
lates into the need for high sensitivity analytical tech- 
niques. Second, many proteins are modified and pro- 
cessed posl-translalipnally. Therefore, in addition to the 
protein identity, the structural basis for differentially 
modified isoforms also needs to be determined- The dis- 
tribution of a constant amount of protein over several 
differentially modified isoforms further reduces the 
amount of each species available for analysis. The com- 
plexity and dynamics of post-translational protein edit- 
ing thus significantly complicates proteome studies. 
Third, proteins vary dramatically with respect to their 
solubility in commonly used solvents. There are few, if 
any, solvent conditions in which all proteins are soluble 
• and which are also compatible with protein analysis. This 
makes the development of protein purification methods 
particularly difficult since both protein purification and 
solubility have to be achieved under the same condi- 
tions. Detergents, in particular sodium dodecyl sulfate 
(SpS), are frequently added to aqueous solvents to 
maintain protein solubility. The compatibility with SDS 
is a big advantage of SDS polyacrylamide gel electro- 
phoresis (SDS-PAGE) over other protein separation 
techniques. Thus, SDS-PAGE and two-dimensional gel 
electrophoresis, which also uses SDS and other deter- 
gents, are the most general and preferred methods for 
the purification of small amounts of proteins, provided 
that activity does not necessarily need to be maintained. 
Lastly, the number of proteins in a given cell system is 
typically in the thousands. Any attempt to identify and 
categorize all of these must use methods which are as 
rapid as possible to allow completion of the project 
within a reasonable time frame. Therefore, a successful, 
general proleomics technology requires high sensitivity^ 
high throughput, the ability to difFerenliale differentially 
modified proteins, and the ability to quantitatively dis- 
play and analyze all the proteins present in a sample. 

3.2 2-D electrophoresis - mass speclrometr)': a common 
implementation of proteome analysis 

The most common currently used impicmcnlalion of 
proteome analysis technology is based on the separation 
of proteins by two-dimensional (lEF/SDS-PAGE) gel 
elecirophoresis and their subsequent identilication and 
analysis by ma.ss spectrometry (MS) or tandem mass 
spectrofTieiry (MS/MS). In 2-DE, proteins are first separ- 
ated by isoclcctrjc focusing (lEF) and then by SDS- 
PAGE. in the second, perpendicular dimension. Separ- 
iiled proteins are visualized at high sensitivity by staining 
or autoradiography, producing two-dimensional arrays of 
proteins. 2-DE gels are, at present, the most commonly 
used means of global display of proteins in complex 
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samples. The separation of thousands of proteins has 
been achieved in a single gel 124. 25) and differentially 
modified proteins are frequently separated. Due to the 
compatibility of 2-DE with high concentrations of deter- 
gents, protein denalurants and other additives promoting 
protein solubility, the technique is widely used. 

The second step of this type of proteome analysis is the 
identification and analysis of separated proteins. Individ- 
ual proteins from polyacrylamide gels have traditionally 
been identified using Mterminal sequencing (26, 27], 
internal peptide sequencing [28, 291, immunoblotting or 
comigration with known proteins (30J. The recent dra- 
matic growth of large-scale genomic and expressed 
sequence tag (EST) sequence databases has resulted in a 
fundamental change in the way proteins are identified by 
their amino acid sequence. Rather than by the traditional 
methods described above, protein sequences are now fre- 
quently determined by correlating mass spectral or 
tandem mass spectral data of peptides derived from pro- 
teins, with the information contained in sequence data- 
bases f 3 1-33}. 

There are a number of alternative approaches to pro- 
teome analysis currently under development. There is 
considerable interest in developing a proteome analysis 
stragegy which bypasses 2-DE altogether, because it is 
considered a relatively slow and tedious process, and 
because of perceived difficulties in extracting proteins 
from the gel matrix for analysis. However, 2-DE as a 
starting point for proteome analysis has many advan- 
tages compared to other techniques available today. The 
most significant strengths of the 2-DE-MS approach 
include the relatively uniform ^behavior of proteins in 
gels, the ability to quantify spots and the high resolution 
and simulUneous display of hundreds to thousands of 
proteins within a reasonable time frame. 

A schematic diagram of a typical procedure of the identi- 
fication of gel-separated proteins is shown in Fig. 2. Pro- 
tein spots detected in the gel are enzymalically or chemi- 
cally fragmented and the peptide fragments are isolated 
for analysis, as already indicated, most frequently by MS 
or MS/MS. There are numerous protocols for the gener- 
ation of peptide fragments from gel-separated proteins. 
They can be grouped into two categories, digestion in 
the gel slice ]28, 34] or digestion after electrolransfer out 
of the gel onto a suitable membrane ((29, 35-37] and 
reviewed in [38]). In most instances either technique is 
applicable and yields good results. The analysis of MS or 
MS/MS data is an important step in the whole process 
because MS instruments can generate an enormous 
amount of information which cannot easily be managed 
manually. Recently, a number of groups have developed 
software systems dedicated to the use of peptide MS 
and MS/MS spectra for the identification of proteins. 
Proteins are identified by correlating the information 
contained in the MS spectra of protein digests or 
MS/MS spectra of individual peptides with data con- 
tained in DNA or protein sequence databases. 

Tlie systems we are currently using in our laboratory are 
based on the separation of the peptides contained in pro- 
tein digests by narrow bore or capillary liquid chromalog- 
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Figure 2. Schematic diagram of a procedure for identtficaiion of gel- 
separaled proteins. Peptides can cither be separated by a technique 
such as LC or CE. or infused as a mixture and sorted in the MS. Data- 
base searching can either be performed on peptide masses from an 
MS spectrum, peptide fragment masses from CID spectra of peptides, 
or a combination of both. 



raphy (39, 40] or capillary electrophoresis (41], the anal- 
ysis of the separated peptides by electrospray ioniza- 
tion (ESI) MS/MS, and the correlation of the generated 
peptide spectra with sequence databases using the 
SEQUEST program developed at the University of Wash- 
ington [32, 331- The system automatically performs the 
following operations: a particular peptide ion character- 
ized by its mass-to-charge ratio is selected in the MS out 
of all the peptide ions present in the system at a parti- 
cular time; the selected peptide ion is collided in a colli- 
sion cell with argon (collision-induced dissociation, 
CID) and the masses of the resulting fragment ions are 
determined in the second sector of the tandem MS; this 
experimentally determined CID spectrum is then corre- 
lated with the CID spectra predicted from all the pep- 
tides in a sequence database which have essentially the 
same mass as ihe peptide selected for CID; this correla- 
(ion matches the isolated peptide with a sequence seg- 
ment in a database and thus identifies the protein from 
which the peptide was derived. There are a number of 
alternative programs which use peptide CID spectra for 
protein identification^ but we use the SEQUEST system 
because it is currently the most highly automated pro- 
gram and has proven to be successful, versatile and 
robust. 



3.3 Protein identification by LC-MS/MS, capillary 
LC-MS/MS and CE-MS/IMS 

li has been demonstrated repeatedly that MS has a very 
high intrinsic sensitivity. For the routine analysis of gel- 
separated proteins at high sensitivity, the most signif- 
icant challenge is the iiandling of small amounts of 
sample. The crux of the problem is the extraction and 
transferal ol peptide mixtures generated by the digestion 
of low nanogram iunounts of protein, from gels into the 
MS/MS system without significant loss of sample or 
introduction of unwanted contaminants. We employ 
ihree different systems for introducing gel-purified sam- 
ples into an MS, depending on the level of sensitivity 



required. As an approximate guideline, for samples con- 
taining tens of picomoles of peptides, tC-MS/MS is 
-most appropriate; for samples containing low picomole 
amounts to high femtomole amounts we use capillary 
LC-MS/MS; and for samples containing femtomoles or 
less, CE-MS/MS is the method of choice. 

3.3.1 LC-MS/MS 

The coupling of an MS to an HPLC system using a 
0.5 mm diameter or bigger reverse phase (RP) column 
has been described in detail [42). This system has several 
advantages if a large number of samples are to be ana- 
lyzed and all are available in sufficient quantity. The 
LC-MS and database searching program can be run in a 
fully automated mode using an autosampler, thus maxi- 
mizing sample throughput and minimizing the need for 
operator interference. The relatively large column is 
tolerant of high levels of impurities from either gel prep- 
aration or sample matrix. Lastly, if configured with a 
flow-splitter and micro-sprayer [40J, analyses can be per- 
formed on a small fraction of the sample (less than 5%) 
while the remainder of the sample is recovered in very 
pure solvents. This latter feature is particulariy useful 
when an orthogonal technique is also used to analyze 
peptide fractions, such as scintillation of an introduced 
radiolabel, and this data can be correlated with peptides 
identified by CID spectra. 

3.3.2 Capillary LC-MS 

An increase of sensitivity of approximately tenfold can be 
achieved by using a capillary LC system with a 100 pm ID 
column rather than a 0.5 mm ID column as referred to 
above. Since very low flow rates are required for such 
columns, most reports have used a precolumn flow split- 
ting system for producing solvent gradients. We have 
recently desribed the design and construction of a novel 
gradient mixing system which enables the formation 
of reproducible gradients at very low flow rates (low 
nL/min) without the need for flow splitting (A. Ducret 
ei ai, submitted for publication). Using this capillary 
LC-MS/MS system we were able to identify gel-separat- 
ed proteins if low picomole to high femtomole amounts 
were loaded onto the gel [40]. This system is as yet not 
automated and, like all capillary LC systems, is prone to 
blockage of the columns by microparticulates when ana- 
lyzing gel-separated proteins. 

3.3.3 CE-MS/MS 

The highest level uf sensitivity for analyzing gel-sep- 
arated proteins can be achieved by using capillary elec- 
trophoresis — mass spectrometry (CE-MS). We have de- 
scribed in the past a solid-phase extraction capillary elec- 
trophoresis (SPE-Cli) system which was used with triple 
quadrupole and ion trap ESl-MS/MS systems for the 
identification of proteins at the low femtomole to sub- 
femtomole sensitivity level (43, 44]. While this system is 
highly sensitive, its operation is labor-intensive and its 
operation has not been automated. In order to devise an 
analytical system with both the sensitivity of a CE and 
the level of automation of LC, we have constructed 
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Figure 3. Schematic iUustration of a 
microfabricated analytical system for C£, 
consisting of a micromachined device, 
coated capillary etectroosmotic pump, 
and microelectrospray interface. The 
dimensions of the channels and reservoir 
are as indicated in the text. The channels 
on the device were graphically enhanced 
to make them more visible. Reproduced 
from (45), with permission. 



microfabricaled devices for the introduction of samples 
into ESI-MS for high-sensitivity peptide analysis. 

The basic device is a piece of glass into which channels 
of 10—30 \im in depth and 50—70 \im in diameter are 
etched by using photolithography/etching techniques 
similar to the ones used in the semiconductor industry. 
(A simple device is shown in Fig. 3). The channels are 
connected to an external high voltage power supply [45]. 
Samples are manipulated on the device and off the 
device to the MS by applying different potentials to the 
reservoirs. This creates a solvent flow by electroosmolic 
pumping which can be redirected by changing the posi- 
tion of the electrode. Therefore, without the need for 
valves or gates and without any external pumping, the 
flow can be redirected by simply switching the position 
of the electrodes on the device. The direction and rate of 
the flow can be modulated by the size and the polarity 
of the electric field applied and also by the charge state 
of the surface. 

The type of data generated by the system is illustrated in 
Fig. 4, which shows the mass spectrum of a peptide sample 
representing the tryptic digest of carbonic anhydrase at 
290 fmol/uL. Each numbered peak indicates a peptide suc- 
cessfully identified as being derived from carbonic an- 

100 r — ■ 



hydrase. Some of the unassigned signals may be chemical 
or peptide contaminants. The MS is programmed to auto- 
matically select each peak and subject the peptide to CID. 
The resulting CID spectra are then used to identify the 
protein by correlation with sequence databases. Therefore, 
this system allows us to concurrently apply a number of 
protein digests onto the device, to sequentially mobilize 
the samples, to automatically generate CID spectra of 
selected peptide ions and to search sequence databases 
for protein identification. These steps are performed auto- 
matically without the need for user input and proteins can 
be identified at very low femtomole level sensitivity at a 
rate of approximately one protein per 15 min. 

3.4 Assessment of 2-DE-MS proteome technology 

Using a combination of the analytical techniques de- 
scribed above we have identified the 80 protein spots 
indicated in Fig. 5. The protein pattern was generated by 
separating a total of 40 microgram of protein contained 
in a total cell lysate of the yeast strain YPH499 by high 
resolution 2-DE and silver staining of the separated pro- 
teins. To estimate how far this type of proteome analysis 
can penetrate towards the identification of tow abun- 
dance proteins, we have calculated the codon bias of the 
genes encoding the respective proteins. Codon bias is a 
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Figure 4. MS spccirum of a iryplic digest 
of carbonic anhydiase using Jhe microfa- 
bricaled syslem shown in Fig. 3. 290 
fmol/ML of carbonic anhydiase iryplic 
digest was infused inio a Finnig.in I.CQ 
ion trap MS. Each peak was selected for 
CID, and those which were identified as 
coniaining peptides derived from car- 
bonic anhydrase arc numbefcd. Repro- 
duced from [451 wiih pefmission. 
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Figure 5. 2-DE separation of a lysatc of yeasi cells» with identified proteins highlighted. The first dimension of separation was an IPG from 
pH 3-10, and the second dimension was a 10%T SDS-PAGE gel. Proteins were visualized by silver staining. Further details of experimenial . 
procedures are included in S. P. Gygi et ol. (submitted). 
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calculated measure of the degree of redundancy of trip- 
let DNA codons used to produce each amino acid In a 
particular gene sequence. It has been shown to be a 
useful indicator of the level of the protein product of a 
particular gene sequence present in a cell [46). The gen- 
eral rule which applies is that the higher the value of the 
codon bias calculated for a gene» the more abundant the 
protein product of that gene becomes. The calculated 
codon bias values corresponding to the proteins identi- 
fied in Fig. 5 are shown in Fig. 6b. Nearly all of the pro- 
teins identified {> 95%) have codon bias values of > 0.2, 
indicating they arc highly abundant in cells. In contrast, 
codon bias values calculated for the entire yeast genotnc 
(Fig. 6a) show that the majority of proteins present in 
the proteome have a codon bias of < 0.2 and are thus of 
low abundance. 

This fmding is of considerable iEiiportance in our assess- 
ment of the current status of proteome analysis technol- 
ogy. It is clear that even using highly sensitive analytical 
techniques, we are only able to visualize and idenlify the 



more abundant proteins. Since many important regula- 
tory proteins are present only at low abundance^ these 
would not be amenable to analysis using such tech- 
niques. This situation would be exacerbated in the anal- 
ysis of proleomes containing many more proteins than 
the approximately 6000 gene products present in yeast 
cells [16]. In the analysis of, for example^ the proteome 
. of any human cells, there are potentially 50000-100000 
gene products (471. Inherent limitations on the amount 
of protein that can be loaded on 2-DE, and the number 
of components that can be -resolved, indicate that only 
the most highly abundant fraction of the many gene 
products could be successfully analyzed. One approach 
thai has been employed to circumvent these limitations 
is the use of very narrow range immobilized pH gradient 
strips for the first-dimension separation of 2-DE M8|. 
Since only those proteins which focus witfiin the narrow 
range will enter the second dimension of separation, a 
much higher sample loading within the desired range is 
possible. This, in turn, can lead to the visualization and 
identification of less abundant proteins. 
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Ftzure 6. Calculated codon bias values for yeast proteins. (A) Distribu- 
tion of calculated values for the entire yeast proteome. (B) Distribu- 
tion of calculated values for the subset of 80 identified proteins also 
shown in Figs. I and 5. Further details of experimental procedures are 
included in S. P. Gygi et at. (submitted). 



A Utility of proteome analysis for biological 

research 

For the success of proleomics as a mainstream approach 
to the analysis of biological systems it is essential to 
define how proteome analysis and biological research 
projects intersect. Without a clear plan for the implemen- 
tation of proieome-type approaches into biological re- 
search projects the full impact of the technology can not 
be realized. The literature indicates that proteome anal- 
ysis is used both as a database/data archive, and as a bio- 
logical assay or biological research tool. 

4.1 The proteome as a database 

The use of proteomics as a database or data archive 
essentially entails an attempt to identify all the proteins 
in a cell or species and to annotate each protein with the 
known biological information that is relevant for each 
protein. The level of annotation can, of course, be exten- 
sive, llie most common implementation of this idea is 
the separation of proteins by high resolution 2-DE, the 
idenlincation of each detected protein spot and the 
annotation of the protein spots in a 2-DE gel database 
format. Tliis approach is complicated by the fact that it is 
difficult to precisely define a proteome and to decide 
which proteome should be represented in the database. 
In contrast to the genome of a species, which is essen- 
tially static, the proteome is highly dynamic. Processes 
such as differentiation, cell activation and disease can all 
significantly chanKe. the proteome of a species. Tliis is 
illustrated in Fig. 7. The figure shows two high-resolu- 



tion 2-DE maps of proteins isolated from rat serum. 
Fig. 7A is from the serum of normal rats, while Fig. 7B 
is from the serum of rats in acute-phase serum after 
prior treatment with an inflammation-causing agent (49). 
It is obvious that the protein patterns are significantiy 
different in several areas, raising the question of exactly 
which proteome is being described. 

Therefore, a comprehensive proteome database of a spe- 
cies or cell type needs to contain all of the parameters 
which describe the state and the type of the ceils from 
which the proteins were extracted as well as the software 
tools to search the database with queries which reflect 
the dynamics of biological systems. A comprehensive 
proteome database should be capable of quantitatively 
describing the fate of each protein if specific systems 
and pathways are activated in the cell. Specifically, the 
quantity, the degree of modification, the subcellular loca- 
tion and the nature of molecules specifically interacting 
with a protein as well as the rate of change of these 
variables should be described. Using these admittedly 
stringent criteria, there is currently no comlete proteome 
database. A number of such databases are, however, in 
the process of being constructed. The most advanced 
among them, in our opinion, are the yeast protein data- 
base YPD (50] (accessible at http://www.ypd.com) and 
the human 2D- PAGE databases of the Danish Centre 
for Human Geiiome Research (12] (accessible at http:// 
biobase.dk/cgi*bin/celis). While neither can be con- 
sidered complete as not all of the potential gene pro- 
ducts are identified, both contain extensive annotation 
of supplemental information for many of the spots 
which are positively identified in reference samples. 

4.2 The proteome as a biological assay 

The use of proteome analysis as a biological assay or 
research tool represents an alternative approach to inte- 
grating biology with proteomics. To investigate the state 
of a system, samples are subjected to a specific proceess 
that allows the quantitative or qualitative measurement 
of some of the variables which describe the system. In 
typical biochemical assays one variable {e.g.^ enzyme 
activity) of a single component (e.g., a particular en- 
zyme) is measured. Using proteomics as an assay, mul- 
tiple variables {e.g., expression level, rate of synthesis, 
phosphorylation state, etc.) are measured concurrently 
on many (ideally all) of the proteins in a sample. The 
use of proteomics as an assay is a less far-reaching prop- 
osition than the construction of a comprehensive pro- 
teome database. It does, however, represent a pragmatic 
approach which can be adapted to investigate specific 
systems and pathways, as long as the interpretation of 
the results takes into account that with current technol- 
ogy not all of the variables which describe the system 
can be observed (see Section 3.4). 

A common implementation of proteome analysis as a 
biological assay is when a 2-DE protein pattern gener- 
ated from the analysis of an experimental sample is 
compared to an array of reference patterns representing 
different stales of the system under investigation. The 
stale of the experimental system at the time the sample 
was generated is therefore determined by the quantita- 
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live comparative analysis of hundreds to a few thousand 
proteins. Comparative analysis of the 2-DE patterns fur- 
Ihermore highlights quantitative and qualitative differ- 
tnces in the protein profiles which correlate with the 
state of the system. For this type of analysis it is not 
essentiar that all the proteins are identified or even visu- 



alized, although the results become more informative as 
more proteins are compared. It is obvious, however, that 
the possibility to identify any protein deemed character- 
istic for a particular state dramatically enhances this 
approach by opening up new avenues for experimenta- 
tion. 





f iRU'c .'. Higl» icsolulioti 2 DI: n>ap of proteins isolated from lat serum with or without prior exposure to an inflam- 
"K^iion-ciiusing agent. (A) normal ral serum, (B) aculc-phase scrum from rats which had previously been exposed to 
>T> innammnUon cnusirip npenl. The fifst ditnension of separation is an IPG rioru pH 4—10, and Ihe second dimcn- 
'»""*»n IS a "^.^ 17 r% r gradient SDS PAGn get. Proteins were visualized by staining with amido black. Further details 
expciiitient;d proccdvires are included in (14, 49). 




Proleome analysis as a biological assay has been success- 
fully used in the field of toxicology, to characlerize 
disease states or lo study differential activation of cells. 
The approach is limited, of course, by the fact that only 
the visible protein spots are included in the assay, and it 
is well known that a substantial but far from complete 
fraction of cellular proteins are detected if a total cell 
lysate is separated by 2-DE. Proteins may not be 
detected in 2-DE gels because they are not abundant 
enough to be visualized by the detection method used, 
because they do not migrate within the boundaries (size, 
pi) resolved by the gel, because they are not soluble 
under the conditions used, or for other reasons. 

A different way to use proteome analysis as a biological 
assay to define the state of a biological system is to lake 
advantage of the wealth of information contained in 
2-DE protein patterns. 2-DE is referred to as two-dimen- 
sional because of the electrophoretic mobility and the 
isoelectric points which define the position of each pro- 
tein in a 2-DE pattern. In addition to the two dimen- 
sions used to generate the protein patterns, a number of 
additional data dimensions are contained in the protein 
patterns. Some of these dimensions such as protein 
expression level, phosphorylation state, subcellular loca- 
tion, association with other proteins, rate of synthesis or 
degradation indicate the activity state of a protein or a 
biological system. Comparative analysis of 2-DE protein 
patterns representing different states is therefore ideally 
suited for the detection, identification and analysis of 
suitable markers. Once again it musl be emphasized that 
in this type of experiment only a fraction of the cellular 
proteins is analyzed. Since many regulatory proteins are 
of low abundance, this limitation is a concern, particu- 
larly in cases in which regulatory pathways are being 
investigated. 

5 Concluding remarks 

In this report we have addressed three main issues 
related lo proteome analysis. First, we have discussed 
the rationale for studying proteomes. Second, we have 
assessed the technical feasibility of analyzing proteomes 
and described current proteome technology, and third, 
we have analyzed the utility of proteome analysis for bio- 
logical research. It is apparent that proteome analysis is 
an essential tool in the analysis of biological systems. 
Tlie multi-level control of protein synthesis and degrada- 
tion in cells means that only the direct analysis of 
mature protein products can reveal their correct identi- 
ties, their relevant slate of modification and/or associa- 
tion and their amounts. Recently developed methods 
have enabled the identification of proteins at ever- 
increasing sensitivity levels and at a high level of auto- 
mation of the analytical processes. A number of tech- 
nical challenges, however, remain. While it is currently 
possible to identify essentially any protein spots that can 
be visualized by common staining methods, it is ap- 
parent thru without prior enrichment only a relatively 
small and highly selected populalion of long-lived, 
highly expressed protein.*; is observed. There are many 
mure proteins in a given cell which are not visualized by 
such methods. Frequently it is the low abundance pro- 
teins thai execute key regulatory functions. 
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We have outlined the two principal ways proteome anal- 
ysis is currently being used to intersect with biological 
research projects: the proteome as a database or data 
archive and proteome analysis as a biological assay. Both 
approaches have in common that at present they are con- 
ceptually and technically limited. Current proteome data- 
bases typically are limited to one cell type and one state 
of a cell and therefore do not account for the dynamics 
of biological systems. The use of proteome analysis as a 
biological assay can provide a wealth of information, but 
it is limited to the proteins detected and is therefore not 
truly proteome-wide. These limitations in proteomics are 
to a large extent a reflection of the fact that proteins in 
their fully processed form cannot easily be amplified and 
are therefore difficult to isolate in amounts sufficient for 
analysis or experimentation. The fact that to date no 
complete proteome has been described further attests to 
these difficulties. With continued rapid progress in pro- 
tein analysis technology, however, we anticipate that the 
goal of complete proteome analysis will eventually 
become attainable. 
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ABSTRACT Wnt family members are critical to many 
developmental processes, and components of the Wnt signal- 
ing pathway have been linked to tumorigenesis in familial and 
sporadic colon carcinomas. Here we report the identification 
of two genes, WISP-l and WISP'2, that are up-regulated in the 
mouse mammary epithelial cell line C57MG transformed by 
Wnt-1, but not by Wnt-4. Together with a third related gene, 
WISPS, these proteins define a subfamily of the connective 
tissue growth factor family. Two distinct systems demon- 
strated WISP induction to be associated with the expression of 
Wnt-1. These included (0 C57MG cells infected with a Wnt-1 
retroviral vector or expressing Wnt-1 under the control of a 
tetracyline repressible promoter, and («) Wnt-1 transgenic 
mice. The WISP-l gene was localized to human chromosome 
8q24.1-8q24.3. WISP-l genomic DNA was amplified in colon 
cancer cell lines and in human colon tumors and its RNA 
overexpressed (2- to >30-fo]d) in 84% of the tumors examined 
compared with patient-matched normal mucosa. WISPS 
mapped to chromosome 6q22-6q23 and also was overex- 
pressed (4- to > 40-fold) in 63% of the colon tumors analyzed. 
In contrast, WISP-2 mapped to human chromosome 20ql2- 
20ql3 and its DNA was amplified, but RNA expression was 
reduced (2- to >30-fold) in 79% of the tumors. These results 
suggest that the WISP genes may be downstream of Wnt-1 
signaling and that aberrant levels of WISP expression in colon 
cancer may play a role in colon tumorigenesis. 



Wnt-1 is a member of an expanding family of cysleine-rich, 
glycosylated signaling proteins that mediate diverse develop- 
mental processes such as the control of cell proliferation, 
adhesion, cell polarity, and the establishment of cell fates (1, 
2). Wnt-1 originally was identified as an oncogene activated by 
the insertion of mouse mammary tumor virus in virus-induced 
mammary adenocarcinomas (3, 4). Although Wnt-1 is not 
expressed in the normal mammary gland, expression of Wnt-1 
in transgenic mice causes mammary tumors (5). 

In mammalian cells, Wnt family members initiate signaling 
by binding to the seven-transmembrane spanning Frizzled 
receptors and recruiting the cytoplasmic protein Dishevelled 
(Dsh) to the cell membrane (1, 2, 6). Dsh then inhibits the 
kinase activity of the normally const itu lively active glycogen 
synthase kinase-BjS (GSK-3/3) resulting in an increase in 
j3-catenin levels. Stabilized j3-catenin interacts with the tran- 
scription factor TCF/Lefl, forming a complex that appears in 
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the nucleus and binds TCF/Lefl target DNA elements to 
activate transcription (7, 8). Other experiments suggest that 
the adenomatous polyposis coli (APC) tumor suppressor gene 
also plays an important role in Wnt signaling by regulating 
j3-catenin levels (9), APC is phosphorylated by GSK-3/3, binds 
to j3-catenin, and facilitates its degradation. Mutations in 
either APC or j3-catenin have been associated with colon 
carcinomas and melanomas, suggesting these mutations con- 
tribute to the development of these types of cancer, implicating 
the Wnt pathway in tumorigenesis (1). 

Although much has been learned about the Wnt signaling 
pathway over the past several years, only a few of the tran- 
scriptionally activated downstream components activated by 
Wnt have been characterized. Those that have been described 
cannot account for all of the diverse functions attributed to 
Wnt signaling. Among the candidate Wnt target genes are 
those encoding the nodal-related 3 gene, Xnr3, a member of 
the transforming growth factor (TGF)-^ superfamily, and the 
homeobox genes, engrailed, goosecoid, twin (Xtwn), and siamois 
(2). A recent report also identifies c-myc as a target gene of the 
Wnt signaling pathway (10). 

To identify additional downstream genes in the Wnt signal- 
ing pathway that are relevant to the transformed cell pheno- 
type, we used a PCR-based cDNA subtraction strategy, sup- 
pression subtractive hybridization (SSH) (11), using RNA 
isolated from C57MG mouse mammary epithelial cells and 
C57MG cells stably transformed by a Wnt-1 retrovirus. Over- 
expression of Wnt-1 in this cell line is sufficient to induce a 
partially transformed phenotype, characterized by elongated 
and refractile cells that lose contact inhibition and form a 
multilayered array (12, 13). We reasoned that genes differen- 
tially expressed between these two cell lines might contribute 
to the transformed phenotype. 

In this paper, we describe the cloning and characterization 
of two genes up-regulated in Wnt-1 transformed cells, WJSP-l 
and WISP-2, and a third related gene, WISP-3, The genes 
are members of the CCN family of growth factors, which 
includes connective tissue growth factor (CTGF), Cyr61, and 
nov, a family not previously linked to Wnt signaling. 

MATERIALS AND METHODS 

SSH. SSH was performed by using the PCR-Select cDNA 
Subtraction Kit (CLONTECH). Tester double-stranded 

Abbreviations: TGF, transforming growth factor; CTGF, connective 
tissue growth factor; SSH, suppression subtractive hybridization; 
VWC, von Willebrand factor type C nnodule. 
Data deposition: The sequences reported in this paper have been 
deposited in the Genbank database (accession nos. AFl 00777, 
AF100778, AF100779, AF100780, and AF100781). 
tTo whom reprint requests should be addressed, e-mail: diane@gene. 
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cDNA was synthesized from 2 yig of poly(A)^ RNA isolated 
from the C57MG/Wnt-1 cell line and driver cDNA from 2 /xg 
of poiyCA)-" RNA from the parent C57MG celJs. The sub- 
tracted cDNA library was subcloned into a pGEM-T vector for 
further analysis. 

cDNA Library Screening. Clones encoding full-length 
mouse WISP-1 were isolated by screening a XgtlO mouse 
embryo cDNA library (CLONTECH) with a 70-bp probe from 
the original partial clone 568 sequence corresponding to amino 
acids 128-169. Clones encoding full-length human WISP-1 
were isolated by screening AgtlO lung and fetal kidney cDNA 
libraries with the same probe at low stringency. Clones en- 
coding full-length mouse and human WJSP-2 were isolated by 
screening a C57MG/Wnt-1 or human fetal lung cDNA library 
with a probe corresponding to nucleotides 1463-1512. Full- 
length cDNAs encoding WISPS were cloned from human 
bone marrow and fetal kidney libraries. 

Expression of Human WISP RNA. PGR amplification of 
first-strand cDNA was performed with human Multiple Tissue 
cDNA panels (CLONTECH) and 300 ^M of each dNTP at 
94°C for 1 sec, 62°C for 30 sec, 72°C for 1 min, for 22-32 cycles. 
WISP and glyceraldehyde-3-phosphate dehydrogenase primer 
sequences are available on request. 

In Situ Hybridization. ^^P-labeled sense and antisense ribo- 
probes were transcribed from an 897-bp PGR product corre- 
sponding to nucleotides 601-1440 of mouse WISP-l or a 
294-bp PGR product corresponding to nucleotides 82-375 of 
mouse WISP'2. All tissues were processed as described (40). 

Radiation Hybrid Mapping. Genomic DNA from each 
hybrid in the Stanford G3 and Genebridge4 Radiation Hybrid 
Panels (Research Genetics, Huntsville, AL) and human and 
hamster control DNAs were PCR-amplified, and the results 
were submitted to the Stanford or Massachusetts Institute of 
Technology web servers. 

Cell Lines, Tumors, and Mucosa Specimens. Tissue speci- 
mens were obtained from the Department of Pathology (Uni- 
versity of Pittsburgh) for patients undergoing colon resection 
and from the University of Leeds, United Kingdom. Genomic 
DNA was isolated (Qiagen) from the pooled blood of 10 
normal human donors, surgical specimens, and the following 
ATCC human cell lines: SW480, COLO 320DM, HT-29, 
WiDr, and SW403 (colon adenocarcinomas), SW620 (lymph 
node metastasis, colon adenocarcinoma), HCT 116 (colon 
carcinoma), SK-CO-1 (colon adenocarcinoma, ascites), and 
HM7 (a variant of ATCC colon adenocarcinoma cell line LS 
174T). DNA concentration was determined by using Hoechst 
dye 33258 intercalation f luorimetry. Total RNA was prepared 
by homogenization in 7 M GuSCN followed by centrifugation 
over CsCl cushions or prepared by using RNAzol. 

Gene Amplification and RNA Expression Analysis. Relative 
gene amplification and RNA expression of WISPs and c-myc in 
the cell lines, colorectal tumors, and normal mucosa were 
determined by quantitative PGR. Gene-specific primers and 
fluorogenic probes (sequences available on request) were 
designed and used to amplify and quantitate the genes. The 
relative gene copy number was derived by using the formula 
2(^") where ACt represents the difference in amplification 
cycles required to detect the WISP genes in peripheral blood 
lymphocyte DNA compared with colon tumor DNA or colon 
tumor RNA compared with normal mucosal RNA. The 
a-method was used for calculation of the SE of the gene copy 
number or RNA expression level. The WlSP-specific signal was 
normalized to that of the glyceraldehyde-3-phosphate dehy- 
drogenase housekeeping gene. All TaqMan assay reagents 
were obtained from Perkin-Elmer Applied Biosystems, 

RESULTS 

Isolation of WISP-1 and WISP-2 by SSH. To identify Wnt- 
1-inducible genes, we used the technique of SSH using the 
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mouse mammary epithelial cell line C57MG and C57MG cells 
that stably express Wnt-1 (11). Candidate differentially ex- 
pressed cDNAs (1,384 total) were sequenced. Thirty-nine 
percent of the sequences matched known genes or homo- 
logues, 32% matched expressed sequence tags, and 29% had 
no match. To confirm that the transcript was differentially 
expressed, semiquantitative reverse transcription-PGR and 
Northern analysis were performed by using mRNA from the 
C57MG and C57MG/Wnt-1 cells. 

Two of the cDNAs, WISP-1 and WISP-2, were differentially 
expressed, being induced in the C57MG/Wnt-1 cell line, but 
not in the parent C57MG cells or C57MG cells overexpressing 
Wnt-4 (Fig. 1 A and B). Wnt-4, unlike Wnt-1, does not induce 
the morphological transformation of C57MG cells and has no 
effect on j3-catenin levels (13, 14). Expression of WISP-1 was 
up-regulated approximately 3-fold in the C57MG/Wnt-1 cell 
line and WISP'2 by approximately 5-foId by both Northern 
analysis and reverse transcription-PGR. 

An independent, but similar, system was used to examine 
WISP expression after Wnt-1 induction. C57MG cells express- 
ing the Wnt-1 gene under the control of a tetracycline- 
repressible promoter produce low amounts of Wnt-1 in the 
repressed state but show a strong induction of Wnt-1 mRNA 
and protein within 24 hr after tetracycline removal (8). The 
levels of Wnt-1 and WISP RNA isolated from these cells at 
various times after tetracycline removal were assessed by 
quantitative PGR. Strong induction of Wnt-1 mRNA was seen 
as early as 10 hr after tetracycline removal. Induction of WISP 
mRNA (2- to 6-fold) was seen at 48 and 72 hr (data not shown). 
These data support our previous observations that show that 
WISP induction is correlated with Wnt-1 expression. Because 
the induction is slow, occurring after approximately 48 hr, the 
induction of WISP^ may be an indirect response to Wnt-1 
signaling. 

cDNA clones of human WISP-1 were isolated and the 
sequence compared with mouse WISP-1. The cDNA sequences 
of mouse and human WISP-1 were 1,766 and 2,830 bp in length, 
respectively, and encode proteins of 367 aa, with predicted 
relative molecular masses of «*40,000 (Mr 40 K). Both have 
hydrophobic N-terminal signal sequences, 38 conserved cys- 
teine residues, and four potential N-linked glycosylation sites 
and are 84% identical (Fig. 2A), 

Full-length cDNA clones of mouse and human WISP-2 were 
1,734 and 1,293 bp in length, respectively, and encode proteins 
of 251 and 250 aa, respectively, with predicted relative molec- 
ular masses of 27,000 (Mr 27 K) (Fig. 2B), Mouse and human 
WISP-2 are 73% identical. Human WISP'2 has no potential 
N-linked glycosylation sites, and mouse WISP-2 has one at 
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Fig. 1. WlSP-1 and WlSP-2 are induced by Wnt-1, but not Wnt-4, 
expression in C57MG cells. Northern analysis of WISP-l {A) and 
WiSP-2 (B) expression in C57MG, C57MG/Wnt-1, and C57MG/ 
Wnt-4 cells. Poly(A)+ RNA (2 /ig) was subjected to Northern blot 
analysis and hybridized with a 70-bp mouse W75P-y-specific probe 
(amino acids 278-300) or a 190-bp W75P-2-specific probe (nucleotides 
1438-1627) in the 3' untranslated region. Blots were rehybridized with 
human |3-actin probe. 
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Fig. 2. Encoded amino acid sequence alignment of mouse and 
human WISP-J (A) and mouse and human WlSP-2 (B). The potential 
signal sequence, insuHn-like growth factor-binding protein (IGF-BP), 
VWC. thrombospondin (TSP), and C-terminal (CT) domains are 
underlined. 

position 197. WJSP-2 has 28 cysteine residues that are con- 
served among the 38 cysteines found in WlSP-1. 

Identification of WISPS, To search for related proteins, we 
screened expressed sequence tag (EST) databases with the 
WISP-1 protein sequence and identified several ESTs as 
potentially related sequences. We identified a homologous 
protein that we have called WISP-3. A full-length human 
WISP'S cDNA of 1,371 bp was isolated corresponding to those 
ESTs that encode a 354-aa protein with a predicted molecular 
mass of 39,293. WISP-3 has two potential N-linked glycosyl- 
ation sites and 36 cysteine residues. An alignment of the three 
human WISP proteins shows that WISP-1 and WISP-3 are the 
most similar (42% identity), whereas WISP-2 has 37% identity 
with WISP-1 and 32% identity with WISP-3 (Fig. 3A). 

WISPs Are Homologous to the CTGF Family of Proteins. 
Human WISP-], WISP-2, and WISPS are novel sequences; 
however, mouse WISP-I is the same as the recently identified 
ElmJ gene. Elml is expressed in low, but not high, metastatic 
mouse melanoma ceils, and suppresses the in vivo growth and 
metastatic potential of K-1735 mouse melanoma cells (15). 
Human and mouse WISP-2 are homologous to the recently 
described rat gene, rCop-I (16). Significant homology (36- 
44%) was seen to the CCN family of growth factors. This family 
includes three members, CTGF, Cyr61, and the protoonco- 
gene nov. CTGF is a chemotactic and mitogen ic factor for 
fibroblasts that is implicated in wound healing and fibrotic 
disorders and is induced by TGF-/3 (17). Cyr61 is an extracel- 
lular matrix signaling molecule that promotes cell adhesion, 
proliferation, migration, angiogenesis, and tumor growth (18, 
19). nov (nephroblastoma overexpressed) is an immediate 
early gene associated with quiescence and found altered in 
Wilms tumors (20). The proteins of the CCN family share 
functional, but not sequence, similarity to Wnt-1. All are 
secreted, cysteine-rich heparin binding glycoproteins that as- 
sociate with the cell surface and extracellular matrix. 

WISP proteins exhibit the modular architecture of the CCN 
family, characterized by four conserved cysteine-rich domains 
(Fig. 35) (21). The N-terminal domain, which includes the first 
12 cysteine residues, contains a consensus sequence (GCGC- 
CXXC) conserved in most insulin-like growth factor (IGF)- 
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Fig. 3. (A) Encoded amino acid sequence alignment of human 
WISPs. The cysteine residues of WISP-1 and WISP-2 that are not 
present in WISP-3 are indicated with a dot. (B) Schematic represen- 
tation of the WISP proteins showing the domain structure and cysteine 
residues (vertical lines). The four cysteine residues in the VWC domain 
that are absent in WISP-3 are indicated with a dot. (C) Expression of 
WISP mRNA in human tissues. PCR was performed on human 
multiple-tissue cDNA panels (CLONTECH) from the indicated adult 
and fetal tissues. 

binding proteins (BP). This sequence is conserved in WISP-2 
and WISP-3, whereas WISP-1 has a glutamine in the third 
position instead of a glycine. CTGF recently has been shown 
to specifically bind IGF (22) and a truncated nov protein 
lacking the IGF-BP domain is oncogenic (23). The von Wil- 
lebrand factor type C module (VWC), also found in certain 
collagens and mucins, covers the next 10 cysteine residues, and 
is thought to participate in protein complex formation and 
oligomerization (24). The VWC domain of WISP-3 differs 
from all CCN family members described previously, in that it 
contains only six of the 10 cysteine residues (Fig. 3 A and B). 
A short variable region follows the VWC domain. The third 
module, the thrombospondin (TSP) domain is involved in 
binding to sulfated glycoconjugates and contains six cysteine 
residues and a conserved WSxCSxxCG motif first identified in 
thrombospondin (25). The C-terminal (CT) module contain- 
ing the remaining 10 cysteines is thought to be involved in 
dimerization and receptor binding (26). The CT domain is 
present in all CCN family members described to date but is 
absent in WISP-2 (Fig. 3 A and B). The existence of a putative 
signal sequence and the absence of a transmembrane domain 
suggest that WISPs are secreted proteins, an observation 
supported by an analysis of their expression and secretion from 
mammalian cell and baculovirus cultures (data not shown). 

Expression of WISP mRNA in Human Tissues. Tissue- 
specific expression of human WISPs was characterized by PCR 
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analysis on adult and fetal multiple tissue cDNA panels. 
WISP-1 expression was seen in the adult heart, kidney, lung, 
pancreas, placenta, ovary, small intestine, and spleen (Fig. 3C). 
Little or no expression was detected in the brain, liver, skeletal 
muscle, colon, peripheral blood leukocytes, prostate, testis, or 
thymus. WISP-2 had a more restricted tissue expression and 
was detected in adult skeletal muscle, colon, ovary, and fetal 
lung. Predominant expression of WISPS was seen in adult 
kidney and testis and fetal kidney. Lower levels of WISPS 
expression were detected in placenta, ovary, prostate, and 
small intestine. 

In Situ Localization of WISP-l and WISP-2. Expression of 
WISP'l and WISP-2 was assessed by in situ hybridization in 
mammary tumors from Wnt-1 transgenic mice. Strong expres- 
sion of WISP- 1 was observed in stromal fibroblasts lying within 
the fibrovascular tumor stroma (Fig. 4 A-D). However, low- 
level WISP-1 expression also was observed focally within tumor 
cells (data not shown). No expression was observed in normal 
breast. Like WISP-1 j WISP-2 expression also was seen in the 
tumor stroma in breast tumors from Wnt-l transgenic animals 
(Fig. 4 E-H). However, WISP-2 expression in the stroma was 
in spindle-shaped cells adjacent to capillary vessels, whereas 




Fig. 4. {A , C, E, and G) Representative hematoxylin/eosin-stained 
images from breast tumors in Wnt-1 transgenic mice. The correspond- 
ing dark-field images showing WISP-1 expression are shown in B and 
D. The tumor is a moderately well-differentiated adenocarcinoma 
showing evidence of adenoid cystic change. At low power (A and B), 
expression of WISP-l is seen in the delicate branching fibrovascular 
tumor stroma (arrowhead). At higher magnification, expression is seen 
in the stroma!(s) fibroblasts (C and £)), and tumor cells are negative. 
Focal expression of WlSP-1, however, was observed in tumor cells in 
some areas. Images of WISP-2 expression are shown in E~H. At low 
power (E and F), expression of WISP-2 is seen in cells lying within the 
fibrovascular tumor stroma. At higher magnification, these cells 
appeared to be adjacent to capillary vessels whereas tumor cells are 
negative (G and H). 



the predominant cell type expressing WISP-1 was the stromal 
fibroblasts. 

Chromosome Localization of the WISP Genes. The chro- 
mosomal location of the human WISP genes was determined 
by radiation hybrid mapping panels. WISP-l is approximately 
3.48 cR from the meiotic marker AFM259xc5 [logarithm of 
odds (lod) score 16.31] on chromosome 8q24.1 to 8q24.3, in the 
same region as the human locus of the novH family member 
(27) and roughly 4 Mbs distal to c-myc (28). Preliminary fine 
mapping indicates that WISP-1 is located near D8S1712 STS. 
WISP-2 is linked to the marker SHGC-33922 (lod = 1,000) on 
chromosome 20ql2-20ql3.1. Human WISPS mapped to chro- 
mosome 6q22-6q23 and is linked to the marker AFM211ze5 
(lod = 1,000). WISP-3 is approximately 18 Mbs proximal to 
CTGF and 23 Mbs proximal to the human cellular oncogene 
MYB (27, 29). 

Amplification and Aberrant Expression of WJSPs in Human 
Colon Tumors. Amplification of protooncogenes is seen in 
many human tumors and has etiological and prognostic sig- 
nificance. For example, in a variety of tumor types, c-myc 
amplification has been associated with malignant progression 
and poor prognosis (30). Because WISP-1 resides in the same 
general chromosomal location (8q24) as c-myc, we asked 
whether it was a target of gene amplification, and, if so, 
whether this amplification was independent of the c-myc locus. 
Genomic DNA from human colon cancer cell lines was 
assessed by quantitative PGR and Southern blot analysis. (Fig. 
5 A and B). Both methods detected similar degrees of WISP-1 
amplification. Most cell lines showed significant (2- to 4-fold) 
amplification, with the HT-29 and WiDr cell lines demonstrat- 
ing an 8-fold increase. Significantly, the pattern of amplifica- 
tion observed did not correlate with that observed for c-myc, 
indicating that the c-myc gene is not part of the amplicon that 
involves the WISP-1 locus. 

We next examined whether the WISP genes were amplified 
in a panel of 25 primary human colon adenocarcinomas. The 
relative WISP gene copy number in each colon tumor DNA 
was compared with pooled normal DNA from 10 donors by 
quantitative PGR (Fig. 6). The copy number of WISP-1 and 
WISP-2 was significantly greater than one, approximately 
2-fold for WISP-1 in about 60% of the tumors and 2- to 4-fold 
for WISP'2 in 92% of the tumors (P < 0.001 for each). The 
copy number for WISPS was indistinguishable from one (P = 
0.166). In addition, the copy number of WISP-2 was signifi- 
cantly higher than that of WISP-1 (P < 0.001). 

The levels of WISP transcripts in RNA isolated from 19 
adenocarcinomas and their matched normal mucosa were 




Fig. 5. Amplification of WISP-1 genomic DNA in colon cancer cell 
lines. (A) Amplification in cell line DNA was determined by quanti- 
tative PGR. (B) Southern blots containing genomic DNA (10 tig) 
digested with EcoKl (WISP-1) orXbal (c-myc) were hybridized with 
a 100-bp human WISP-1 probe (amino acids 186-219) or a human 
c-myc probe (located, at bp 1901-2000). The WISP and myc genes are 
detected in normal human genomic DNA after a longer film exposure. 
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Fig. 6. Genomic amplification of WISP genes in human colon 
tumors. The relative gene copy number of the WISP genes in 25 
adenocarcinomas was assayed by quantitative PCR, by comparing 
DN A from primary human tumors with pooled DNA from 10 healthy 
donors. The data are means ± SEM from one experiment done in 
triplicate. The experiment was repeated at least three times. 

assessed by quantitative PCR (Fig. 7). The level of WISP-I 
RNA present in tumor tissue varied but was significantly 
increased (2- to >25-fold) in 84% (16/19) of the human colon 
tumors examined compared with normal adjacent mucosa. 
Four of 19 tumors showed greater than 10-fold overexpression. 
In contrast, in 79% (15/19) of the tumors examined, WISP-2 
RNA expression was significantly lower in the tumor than the 
mucosa. Similar to WISP-l, WISPS RNA was overexpressed in 
63% (12/19) of the colon tumors compared with the normal 
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Fig. 7. WISP RNA expression in primary human colon tumors 
relative to expression in normal mucosa from the same patient. 
Expression of WISP mRNA in 19 adenocarcinomas was assayed by 
quantitative PCR. The Dukes stage of the tumor is listed under the 
sample number. The data are means ± SEM from one experiment 
done in triplicate. The experiment was repeated at least twice. 



mucosa. The amount of overexpression of WISPS ranged from 
4- to >40-fold. 



DISCUSSION 

One approach to understanding the molecular basis of cancer 
is to identify differences in gene expression between cancer 
cells and normal cells. Strategies based on assumptions that 
steady-state mRNA levels will differ between normal and 
malignant cells have been used to clone differentially ex- 
pressed genes (31). We have used a PCR-based selection 
strategy, SSH, to identify genes selectively expressed in 
C57MG mouse mammary epithelial cells transformed by 
Wnt-1. 

Three of the genes isolated, WISP-l, WISP'2, and WISPS, 
are members of the CCN family of growth factors, which 
includes CTGF, Cyr61, and nov, a family not previously linked 
to Wnt signaling. 

Two independent experimental systems demonstrated that 
WISP induction was associated with the expression of Wnt-l. 
The first was C57MG cells infected with a Wnt-1 retroviral 
vector or C57MG cells expressing Wnt-1 under the control of 
a tetracyline-repressible promoter, and the second was in 
Wnt-1 transgenic mice, where breast tissue expresses Wnt-1, 
whereas normal breast tissue does not. No WISP RNA expres- 
sion was detected in mammary tumors induced by polyoma 
virus middle T antigen (data not shown). These data suggest 
a link between Wnt-1 and WlSPs in that in these two situations, 
WISP induction was correlated with Wnt-1 expression. 

It is not clear whether the WISP^ are directly or indirectly 
induced by the downstream components of the Wnt-1 signaling 
pathway (i.e., /3-catenin-TCF-l/Lefl). The increased levels of 
WISP RNA were measured in Wnt-l-transformed cells, hours 
or days after Wnt-1 transformation. Thus, WISP expression 
could result from Wnt-1 signaling directly through j3-catenin 
transcription factor regulation or alternatively through Wnt-1 
signaling turning on a transcription factor, which in turn 
regulates WISPs. 

The WISPs define an additional subfamily of the CCN family 
of growth factors. One striking difference observed in the 
protein sequence of WISP-2 is the absence of a CT domain, 
which is present in CTGF, Cyr61, nov, WISP-l, and WISP-3. 
This domain is thought to be involved in receptor binding and 
dimerization. Growth factors, such as TGF-/3, platelet-derived 
growth factor, and nerve growth factor, which contain a cystine 
knot motif exist as dimers (32). It is tempting to speculate that 
WISP-l and WISP-3 may exist as dimers, whereas WISP-2 
exists as a monomer. If the CT domain is also important for 
receptor binding, WISP-2 may bind its receptor through a 
different region of the molecule than the other CCN family 
members. No specific receptors have been identified for CTGF 
or nov. A recent report has shown that integrin ayjSa serves as 
an adhesion receptor for Cyr61 (33). 

The strong expression of WISP-l and WISP-2 in cells lying 
within the fibrovascular tumor stroma in breast tumors from 
Wnt-1 transgenic animals is consistent with previous obser- 
vations that transcripts for the related CTGF gene are pri- 
marily expressed in the fibrous stroma of mammary tumors 
(34). Epithelial cells are thought to control the proliferation of 
connective tissue stroma in mammary tumors by a cascade of 
growth factor signals similar to that controlling connective 
tissue formation during wound repair. It has been proposed 
that mammary tumor cells or inflammatory cells at the tumor 
interstitial interface secrete TGF-/31, which is the stimulus for 
stromal proliferation (34). TGF-j31 is secreted by a large 
percentage of malignant breast tumors and may be one of the 
growth factors that stimulates the production of CTGF and 
WISPs in the stroma. 

It was of interest that WISP- 1 and WISP-2 expression was 
observed in the stromal cells that surrounded the tumor cells 
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(epithelial cells) in the Wnt-1 transgenic mouse sections of 
breast tissue. This finding suggests that paracrine signaling 
could occur in which the stromal cells could supply WISP-1 and 
WISP-2 to regulate tumor cell growth on the WISP extracel- 
lular matrix. Stromal cell-derived factors in the extracellular 
matrix have been postulated to play a role in tumor cell 
migration and proliferation (35). The localization of WISP-1 
and WISP'2 in the stromal cells of breast tumors supports this 
paracrine model. 

An analysis of WISP-J gene amplification and expression in 
human colon tumors showed a correlation between DNA 
amplification and overexpression, whereas overexpression of 
lVISP-3 RNA was seen in the absence of DNA amplification. 
In contrast, WISP'2 DNA was amplified in the colon tumors, 
but its mRNA expression was significantly reduced in the 
majority of tumors compared with the expression in normal 
colonic mucosa from the same patient. The gene for human 
WISP-2 was localized to chromosome 20ql2-20ql3, at a region 
frequently amplified and associated with poor prognosis in 
node negative breast cancer and many colon cancers, suggest- 
ing the existence of one or more oncogenes at this locus 
(36-38). Because the center of the 20ql3 amplicon has not yet 
been identified, it is possible that the apparent amplification 
observed for WISP-2 may be caused by another gene in this 
amplicon. 

A recent manuscript on rCop-l, the rat orthologue of 
WISP-2, describes the loss of expression of this gene after cell 
transformation, suggesting it may be a negative regulator of 
growth in cell lines (16). Although the mechanism by which 
WISP-2 RNA expression is down -regulated during malignant 
transformation is unknown, the reduced expression of WISP-2 
in colon tumors and cell lines suggests that it may function as 
a tumor suppressor. These results show that the WISP genes 
are aberrantly expressed in colon cancer and suggest that their 
altered expression may confer selective growth advantage to 
the tumor. 

Members of the Wnt signaling pathway have been impli- 
cated in the pathogenesis of colon cancer, breast cancer, and 
melanoma, including the tumor suppressor gene adenomatous 
polyposis coli and p-catenin (39). Mutations in specific regions 
of either gene can cause the stabilization and accumulation of 
cytoplasmic /3-catenin, which presumably contributes to hu- 
man carcinogenesis through the activation of target genes such 
as the WISPs. Although the mechanism by which Wnt-1 
transforms cells and induces tumorigenesis is unknown, the 
identification of WISPs as genes that may be regulated down- 
stream of Wnt-1 in C57MG cells suggests they could be 
important mediators of Wnt-1 transformation. The amplifica- 
tion and altered expression patterns of the WISPs in human 
colon tumors may indicate an important role for these genes 
in tumor development. 
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ABSTRACT 

Protein kinase C (PKC) has been implicated In colon carcinogenesis in 
humans and in rodent models. However, little Is known fibouC the spcciflc 
nie of individua] PKC Isozymes in this process. We recently demonstrated 
(liat elevated expression of PKC pit in the colonic epithelium induces 
hypcrproHferailon in vivo (N» R. Murray e/a/., J. Cell Biol., 145: 699-711, 
1999). Because hyperproliferatlon is a m^or risk factor for colon cancer, 
we assessed whether speclHc alterations in PKC 011 expression occur 
during azoxymethane-induced colon carcinogenesis in mice* An increase 
in PKC pil expression was observed In preneoplastic lesions (aberrant 
crypt foci, 3,7-foid) compared with sallne^treated animals, and In colon 
tumor* (7.8-roldi P = 0.011) compared with uninvolved colonic epithe- 
lium. In contrast, PKC a and PKC fiJ (a splidng variant of PKC fill) 
expression was slightly decreased In aberrant crypt foci and dramaticaUy 
reduced In colon tumors. Quantitative reverse transcription-PCR analysis 
rtvealed that PKC mRNA levels do not directly correlate with PKC 
protein levels, indicating that PKC isozyme expression Is likely regulated 
at the posltranscriptlonal/translational level. Finally, transgenic mice ex- 
pressing elevated PKC fill in the colonic epithelium exhibit a trend 
toward Increased colon tumor formation after exposure lo aioxymethane. 
Taken together, our results demonstrate that elevated expression of PKC 
^11 is an bnportont early, promotive event that plays a role In colon cancer 
development. 

INTRODUCTION 

Colon carcinogenesis is a mullistep process involving both epige- 
netic changes in signal transduction pathways and genetic alterations 
In oncogenes and tumor suppressor genes, resulting in progressive 
disregulation of cell proliferation and survival mechanisms (1, 2), 
PKC^ is a family of 12 distinct serine/threonine kinases that partici- 
pate in signaling pathways involved in cellular proliferation, differ- 
entiation, and apoptosis in diverse cell systems (3). Differences in 
tissue expression, subcellular localizadon, and activator/substrate 
specificity predict that individual PKC isozymes have distinct cellular 
functions (4-7). Alterations in PKC isozyme levels have been ob- 
served in human and rodent colon cancers, suggesting a role for PKC 
in colon carcinogenesis (2, 8-10). liowever, little is known about the 
specific role of individual PKC isozymes in the carcinogenic process 
in vivo. 

Several studies have investigated expression of the PKC fi isozyme 
in rat and human colon tumors, with varied results (lO^M). PKC fi 
protein expression was found to increase (8, I i), remain the same 
(10), or decrease (!2) in colon tumors as compared with normal 
epithelium. When the level of PKC /3 mRNA was analyzed, it was 
found to be decreased in colon tumors when compared with normal 
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epithelium (15, 16). However, the PKC fi gene actually codes for two 
distinct proteins generated by alternative splicing, PKC /3I and PKC 
fill, which differ in the last 50 amino acids (17). Because the expres- 
sion of these two PKC jS isozymes may be differentially regulated 
(18). it is important to analyze the expression of each individual PKC 
P isozyme to assess its potential role in colon cancer. 

We recently demonstrated that overexpression of PKC fill in the 
colonic epithelium results in hyperproliferation and increased suscep* 
tibility to carcinogen-induced preneoplastic lesions (19)» demonstrat- 
ing a direct role for the PKC fill isozyme in proliferation in the 
colonic epithelium. Based on these data, we hypothesized that an 
increase in expression of PKC J3II Is an early event in colon carcino- 
genesis that occurs in preneoplastic tissue to provide a proliferative 
advantage to precancerous cells. To test this hypothesis, we analyzed 
the expression of PKC jSlI, its splice variant, PKC ^I, and PKC a in 
normal colonic epithelium, in AOM-induced early preneoplastic ACF, 
and in colon tumors. 

Our results demonstrate that expression of PKC fill increases 
dramatically early in the carcinogenic process. In contrast, expression 
of the related PKC a and PKC ^I isozymes decreases later in tumor 
development. Furthermore, transgenic mice overex pressing PKC j3II 
in the colon exhibit increased susceptibility to carcinogen-induced 
colon tumors. Taken togedier, our results demonstrate that a specific 
increase in PKC fill expression is an early event in colon carcinogen- 
esis that has a direct, promotive role in the development of ACF and 
colon tumors. Our data also suggest that loss of PKC a and PKC fil 
expression may be involved in later stages of tumorigenesis. The 
present study is the first to directly investigate PKC isozyme expres- 
sion in both ACF and colon tumors in the mouse. Our results provide 
a better understanding of the potential role of specific PKC isozymes 
in the muitistep process of colon carcinogenesis. 



MATERULS AND METHODS 

Carcinogen Treatment. Female 6-7-week-old C57BL/61 mice (The lack- 
son Laboratory) were injected i.p. with AOM at a dose of 10 mg/kg body 
weight or with an equal volume of saline once a week for 2 weeks, as described 
previously (20). AU animals were housed five mice/cage in microisolator cages 
in a pathogen-frcc barrier facility maintained at a constant temperature and 
humidity on a i2-h light/l2-h dark cycle. Mice were provided with a standard 
amoclavable chow (Purina 7012; 5% fat) and autoclaved water a J libitum until 
the termination of the experiment. Animals were sacrificed at either 20 or 36 
weeks after the second AOM injection for analysis. 

Isolation of ACF. At 20 weeks after the second AOM injection, animals 
(AOM- or saline-treated animals) were sacrificed by cervical dislocation. The 
colons were removed, flushed with ice-cold PBS, and cut longitudinally. For 
protein analysis, colons were fixed flat between sheets of fitter paper in 70% 
ethanol for 30 min at 4*^C (21). After staining with 1% methylene blue in PBS 
for 5 min, the fixed colons were assessed for the presence of ACF at low 
magnification (X40) on a dissecting microscope using criteria described pre- 
viously (22, 23). ACF from eight to nine AOM-treated mice (range, 42-- 88 
ACF/group; average, 2 ± 1 crypts/ACP; range, 1-7 ciypts/ACF) were isolated 
using fine^tipped forceps as described previously (21), pooled in ice-cold 
radioimrounoprecipitation assay buff^er (SO mM Tris (pH 7.2), 150 mM NaCl, 2 
mM EDTA, 0.4 mM BGTA, 20 sodium fluoride, 0.5% deoxycholate. 1% 
NP40, 0.1% SDS, 0.1 mM sodium orthovanadate, 2S Mg^ml aprotinin, 20 
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/ig/ml Icupeptm, 2.5 /tg/ml pepsuiln, I pig/ml soybean trypsin inhibitor, and 
34.5 Mg/ml 4-(2-aniinocthyl)bcnzcne sulfonyl fluoride], and boiled immedi- 
ately In SDS Laemmli sample buffer The protein concentration of these 
extracts was determined using a Coomassie Filler Protein Assay (24). Normal 
colonic crypts were> isolated from the colon of saline-treated animals in a 
pattern representative of the disttibuiion of ACF in colons from AOM-ireatcd 
animals and processed as described above. 

For RNA analysis, ACF and normal crypts were isolated as described 
above, except that the colon was flushed with diethyl pyrocarbonate-ireaied 
PBS, and crypt isolation was performed on unfixed tissue. All steps were 
carried out under RNase-free conditions. The isolated ACF and normal crypts 
were put into Triiol reagent (Life Technologies, Inc.) and immediately frozen 
at -BO'C until analysis. 

Isolation of AOM-lnduccd Mouse Colon Tumors and Normal Colonic 
Epithelial Tissue. At 36 weeks after the second AOM injection, animals were 
sacrificed by CO, asphyxiation; the colons were isolated, cut longitudinally, 
and rinsed well with ice-cold PBS; and tumors were identified visually. 
Tumors were removed from the colon, divided, and processed for histopalho- 
logical analysis and protein and RNA isolation. For protein isolation, normal 
colonic epithelium and tumor tissues were processed as described above. The 
protein concentration of each sample was determined by the BCA assay 
(Pierce), and samples were subjected to immunoblot analysis as described 
previously (19). 

For RNA isolation, tumors and scraped normal colonic epithelial tissues 
were snap frozen in liquid nitrogen and stored at -80^ until extraction. Total 
celhilar RNA was isolated from frozen colonic mucosa and tumor dssue using 
TVizd reagent (Life Technologies, Inc.) according to the manufocturer*s in- 
stntctlons. The integrity of 18S and 28S iRNA was determined by ethidium 
bromide staining after electiophoictlc separation in agarose gels. 

For histopaihological examination, umiors were fixed in 4* paraformalde- 
hyde for 4 h at 4''C, washed several times with ethanol. paraffin embedded as 
described previously (25), and stained with H&E. Complete histopadiological 
analyses were carried on all tumor samples by a board-certified pathologist 
CZ.G.). 

Immunoblot Detection of PKC Isozymes. Protein extracts from normal 
colonic crypts, ACF» and colon mmors were subjected to SDS -PAGE on 10% 
gels. After transfer Co nitrocellulose (Schleicher & SchucU), samples were 
subjected to immunoblot analysis using isozyme-spedfic antibodies to PKC a 
(26), PKC pi PKC fill, and actin (Santa Crui Biotechnology). Immune 
complexes were detected by binding of affinity-purified peroxidase-labelcd 
secondary antibodies (Kirkegaard & Perry Laboratories) and reaction with the 
SuperSignal Pico (PKC a, PKC pi, and /5-actin) or SuperSignal Femlo (PKC 
/all; Pierce) chemilumlnescence system. Rat brain lysatc was used as a positive 
control for each antibody. Band intensity was quantified by densitometric 
scanning (Molecular Dynamics). The level of expression of individual PKQ 
isozymes was normalized to the 0-acttn levels in each sample. 

Imnmnohistochemlcal Analysis of PKC pU Expression In Mouse Colon 
Tumors and Normal Colonic Epithelium. Immonohistochemical analysis 
was performed on S«fun sections of nomial colonic epithelium and colon 
tumors that were paraformaldehyde fixed and paraffin embedded. After depar- 
affinizatbn and rehydration, sections were processed for antigen letrieval as 
described by the manufacturer (DAKO) and treated with 3% hydrogen perox- 



ide in methanol to inhibit endogenous peroxidases. PKC pil expression was 
detected using the ABC staining system and an isoiype-specific antibody to 
PKC p II (Santa Cruz Biotechnology). 

QRT-PCR Analysis of PKC Isozyme mRNA Expression. Total cellular 
RNA from ACF, normal colonic epithelium, and colon tumors was isolated 
using Trizol reagent (Life Technologies. Inc.) according to the manufacturer*? 
instruclions. Reverse transcription was performed using 2 ;ig of total RNA, I 
/Jig of oligo(dT) primer (Promcga), 10 mM DTT, 0.5 mM deoxynuclcoude 
triphosphates, and 200 units of Superscript II reverse transcriptase (Life 
Technologies, Inc.) in a 50-;tl reaction as described previously (II). Samples 
without reverse transcriptase served as negative controls. Amplification of the 
cDNAs was carried out using EasyStan PCR raix-in-a-iube (Molecular Bio- 
products), 2.5 unlis of Taq polymerase (Pcomega), and the appropriate primer 
pairs for each PKC Isozyme. Primers used for PKC isozymc-specific PCR and 
the expected sizes of ihcir products arc as follows: (a) PKC tt. 5'-TOAAT C- 
CTCAGTGGAATGAGT-3' (forward primer) and 5'-G0TTGCnTCT- 
GTCTTCTOAA-3' (reverse primer), 325 bp; (fc) PKC fil S'-TOTGATO- 
GAGTATCTGAAC(jG(}GG.3' (forward primer) and 5'-TCGAAGTT- 
GGAGCTGTCrrCGCTTG-3' (reverse primer). 640 bp; (c) PKC pll, 5'- 
CATCTGGOATGGCGTaACAACC-3' (forward primer) and 5'-CG0TC- 
GAAGTTTTCAGCGTTTC-3' (reverse primer), 420 bp; and (d) P-aciln, 5^- 
GTGGGCCGCrCTAGGCACCAA-3' (forward primer) and 5'-CTCT- 
TGATCTCACGCA(X}ATTTC-3' (reverse primer), 540 bp. The optimized 
linear range for each PKC isozyme was determined as 30 cycles for PKC a, 35 
cycles for PKC pi and pH, and 25 cycles for ^-actin at 95'»C for 45 s, 60'C 
for 45 s. and 72'C for 2 min, followed by a lO-min incubation at 77*0, An 
initial denaturation step was performed at 95^*0 for 2 min. PCR products were 
separated in 1.8-% agarose gel (FMC Bioproducts). and the intensity of 
^hidium bromide fiuorescence was quantitated using an Eagle Eye (Strat- 
agene) densitometer. (Quantitation of PKC mRNA expression was normalized 
to |3-acdn mRNA levels, which have been shown to be unchanged during colon 
carcinogenesis (27). 

Quantitation of Colon Tumor Parameters In AOM-treated Transgenic 
PKC pil and Nontransgenic Mice. PKC ^11 transgenic mice and noncrans* 
genie lilicrmales (19) were injected with AOM as described above. At 36 
weeks afier the second AOM ii^ecUon. the mice were sacrificed, and the 
colons were removed, col open longitudinally, and rinsed well with cold PBS. 
Colon tumors were identified visually, and the location and size (mm^) of each 
tumor were recorded. 

Statistical Analysis. Values are expressed as fold of control expres- 
sion ± SE. Statistical significance was calculated by using a one-way 
ANOVA; P £ 0.05 was considered to be statistically significant. 



RESULTS 

Elevated PKC fill Expression In ACF. We recently demon- 
strated that increased expression of PKC ^11 leads to hyperprolifera- 
tion of the colonic epithelium (19). Because hypetproliferation is an 
early promotive event in colon carcinogenesis, we assessed the level 
of expression of PKC fill in early preneoplastic lesions in AOM- 
treated mice. ACF are the earliest identifiable preneoplastic lesions in 



Fig. I. ACF dissection from an AOM<u%ated 
mouse colon. C57BU6I mice icceived injection of 
AOM or saline once b week for 2 weeks ( 19). At 20 
weeks after the second AOM injeetion. mice were 
sacriflced. and the colons wcte Isolated. The colons 
were cui longitudinally, fixed Rat in 70% edionol. 
end stained wldi 1% methylene blue for 5 min. 
ACF were Identified under low magnification 
(X40) using previously defined criteria for AGP 
(22, 13). A icpiesentatlve colonic epithelium con- 
taining an ACF fiom an AOM-tieated mouse Is 
chown before (A) and after (0) microdissection of 
the AGP. 
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Che prog^ression of normal colonic epithelium to colon carcinoma in 
both carcinogen-Induced rodent models and humans (22, 23. 28). 
AOM r&producibly induces ACP and colon tumors in rodents, which 
eKhibit xniny of the same genetic and signal transduction defects 
identified in human colon carcinomas (29-31). C57BL/6J mice were 
injected with AOM or saline as described. At the preneoplastic stage 
of colors carcinogenesis (20 weeks after injection), mice were sacri- 
ficed, and their colons were removed and analyzed visually for the 
presence of ACF. Individual ACP were microdissected from stained 
colons for biochemical analysis as described in "Materials and Meth- 
ods." A representative colonic epithelium containing an ACF from an 
AOM-tr«ated mouse is shown before (Fig. \A) and after (Fig. 1^) 
microdissection of the ACP. No ACF were observed in salinc-u-eatcd 
control animals, consistent with our previous report (19). ACP from 
AOM'treated mice and normal colonic epithelial tissue from saline- 
treated mice were isolated as described previously (21), pooled, and 
subjected to immunoblot and RNA analysis as deschbed in "Materials 
and Methods." A representative immunoblot for PKC a, /3I, and j3II 
in ACF and normal ciypts is shown in Fig. 2A. A single band with an 



20 weeks AOM treatment 
N ACF 




PKCa 



Ftg. 3. Colon lumor from an A(^-ireued mouse. A, gross pathology of a tumorw 
conminlDg colon from an AOM-tieatcd mouie. histopadioiogy of typical severtly 
dyspSosiie, tubular adenoma from the eolon of aa AOM-treaied notne. Samj^ was 
stained with H&E (XlO). 
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Fig. 2. Iromunoblot analysts for PKC isozyme expression in ACP and normal colonic 
uypu. Carcinogen treatment and colon isolation were perfomied as described in the Fig. 
1 legend. ACF from AOM>(reated mice and nonnal cryptt from SBline-ireaied mice were 
isolated from fixed, stained colons as described in "Materials and Methods.** Total cellular 
protein <5 ;ig) was subjected to immunoblot analysis using isozyme-specific PKC antl« 
bodies. A, a representative immunoblot analysis for the indicated PKC Isofonn is shown. 
M normal cryprs from saline-tieaied animals; ACF, ACF from AOM^lreated animals, fl. 
immunoblois weie quantitated by densliometric scanning of the film. Results are pre- 
sented as the average of two independent experiments. 



apparent molecular mass of 85 kDa for PKC er, ^], and /31I was 
detected, consistent with the predicted size for each intact protein. The 
changes in PKC isozyme levels from two independent experiments 
were quantitated by densitometric analysis, and the average relative 
levels of expression were presented as fold of control (Fig. 2B). PKC 
^11 protein was increased 3.7-foid in ACF relative to normal epithe- 
lium (range, 2.0-S3*fold). In contrast, PKC a and pi were modestly 
decreased in ACP. These finding demonstrate that multiple changes 
in PKC isotype expression occur in the early, preneoplastic stage of 
colon tumor formation. 

PKC Isozyme Expression Is Altered in Colon Tumors. We next 
determined the level of PKC a, fih and pil protein in mouse colon 
tumors at 36 weeks after AOM treatment. Tumors were isolated and 
stained with H&E as described in ^'Materials and Methods." His- 
topathological analysis determined that all tumors were tubular ade- 
nomas, a majority of which were severely dysplastic/carcinoma in 
situ. No invasion of the lamina propria was observed in any lesions. 
The gross pathology and hlstopathoiogy of a representative colon 
tumor are shown in Fig. 3, A and respectively. Immunoblot analysis 
of representative mouse colon tumors demonsuated a dramatic in- 
crease in PKC fill expression, whereas PKC ^I and a were suikingly 
rediKed or lost in colon tumors compared with normal epithelium 
(Fig. 4A). Quantitative analysis of samples from multiple tumors 
demonstrated a statistically significant increase in PKC |3II expression 
in all AOM-induced mouse colon tumors analyzed (7.7 ± 2.1; 
P B 0.0U), whereas PKC or and ^I were both significantly decreased 
in tumors (0.16 ± 0.05 (/» = 4 X 10"') and 0.08 ± 0.05 
(/> » 2 X 10''). respectively] compared with uninvoived colonic 
mucosa. Similar alterations in PKC isozyme expression were ob- 
served in human colon tumors, confirming the similarity of the mouse 
carcinogen model to human colon tumors.^ Taken together, our results 
indicate that an increase in PKC 011 expression occurs early in colon 
carcinogenesis and is maintained in the later stages of this process. 



* Y. OSkmen-Folar and A. P. Fields, unpublished results. 
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Fig. 4. Expression of PKC isozymes in AOM-induccd mouse coloi» lumors end 
uninvolved colonic epithelium. C57BL/6J mice were injected with AOM lo induce colon 
tumors as described in "Materials and Metbods." Thiciy-slx weeks after AOM ireatroent. 
the mice were sscrjficcd. and the colons were excised. Uninvolved colonic epiihethim and 
colon (umoTS were isolated from (he s&nie animal as described in "Materials and Meth- 
ods." A, total cellular protein (10 fig) was subjected to iramunobloi analysis using 
isozyroe-specinc PKC anltbodies. Each sample was normalized according to the j3-actin 
levels. Representative rcsulu of immunobloi analysis with a specine antibody to the 
indicated PKC isozyme are shown. //, uninvolved mouse oolonic epithelium; T. mouse 
colonic lumon. B, immunrtrfots from five pain of tumor and uninvolved tissue were 
quanltioted. and the mean £ SB value is presented for each PKC isozyme. /» < 0.05. 



Our data also demonstrate thai the elevation of PKC j3II is isozyme 
specific because a progressive decrease in PKC a and pi is observed 
dunng tumor development. 

Immunohistodiemical Detection of PKC fill Expression. To 
examine the distribution of PKC in colon lumors, immunohisto- 
chemical detection of PKC PU was performed on normal epithelium 
from the distal colon and colon tumors isolated from AOM-injected 
mice (Fig. 5). Low level PKC /311 staining was detected in both the 
base of the crypts and at the luminal surface of the normal epithelium 
(Fig. 5/1). The PKC j3ll staining at ihe base of the colonic crypts is 
concordant with the location of the stem cell population in the distal 
colon, consistent with a role for PKC pU in proliferation. However, 
the presence of immunoreactivc PKC pll in postmitotic colonocytcs 
suggests that PKC fill may serve multiple roles in this tissue. In colon 
tumors (Fig. 5B). a high level of PKC pil staining was detected 
throughout the tumor, with the highest levels seen at the surface of the 
tumor. Interestingly, little or no staining was detected in the stromal 
elements of the tumor, indicating that the observed increase in PKC 
pil in colon tumors is due to an increased expression in the epithelial 
cells comprising the tumor rather than tumor-associated stromal or 
immune cells. These results confirm the immunoblot data showing 
overexpression of PKC $11 in colon tumors and demonstrate the 



colocallzaiion of PKC pil with the proliferative cell fraction in both 
normal colonic epithelium and colon lumors. 

PKC pil mRNA Is Only Slightly Increased in AOM-induced 
ACF. To investigate the regulatory mechanisms responsible for PKC 
isozyme expression during the early and late stages of colon tumor- 
igenesis, we analyzed the relative mRNA levels for PKC a. /3I, and 
PU in AOM-induced ACF and normal crypts. Total RNA was isolated 
from ACF and normal crypts and subjected to QRT-PCR using 
primers specific for mouse PKC a. pi, and pll isozymes, as described 
in **Materials and Methods." Representative QRT-PCR analyses are 
shown in Fig. 6A. For each RNA sample, no amplified products were 
detected in the absence of reverse transcriptase (data not shown). 
Relative band intensities were quantitated by densitomelric scanning 
and are presented as the fold change in expression in Fig. 65. The 
level of PKC pil mRNA in ACF was only slightly increased relative 
to that m normal colonic epithelium, as compared with the 3.7-fold 
increase in PKC pil protein detected in ACF. In contrast, PKC fil and 
a mRNA levels in ACF were decreased to a level commensurate with 
the decrease in protein expression of these isozymes in ACF. 

PKC fill mRNA Levels Do Not Reflect the Dramatic Alter- 
fttions in PKC pU Protein Levels In Colon Tumors. Total RNA 
was isolated from both colon tumors and normal colonic epithelium at 
36 weeks after AOM injection and subjected to QRT-PCR using 
primers specific for mouse PKC a, pi. and pil isozymes as described 
in "Materials and Methods" (Fig. 7). Relative mRNA levels were 
quantitated by densitometric scanning, and the fold increase in ex- 
pression was calculated for each PKC isozyme (Fig. 7B). PKC pU 
mRNA levels were slightly elevated in tumors, whereas those for PKC 
a were slightly reduced compared with nontnal epithelium. The level 
of PKC ^I mRNA was also increased slightly in tumors. Therefore, 
the small alterations in mRNA expression for these PKC isozymes do 
not correlate with the dramatic changes in expression of the corre- 
sponding protein. 




Ftg. 5. Immunohistochemtcal analysis of PKC in mouse colon (umor and nonnal 
colonic ephhelium. Immunohistoehemical detecUon of PKC expression was per- 
formed at described in "Matedals and Methods.*' PKC /3I1 axpression was detected using 
(he ABC staining system and an isozyme-specinc antibody to PKC ^11. Antigen was 
visualized by 3.3'-diaminobentidine staining. A. nonnal mouse colonic cptchellum. 
bar a 100 fiM; colon tumor from AOM-iftAied mouse, bar^ SO^m. 
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also elevated from 12.5 ± 5.5 mm'/mouse in nonlransgenic mice lo 
19.3 i 5.4 mm^/mouse in transgenic PKC mice, finally, the 
average tumor size was greater in transgenic PKC pXL mice 
(12,4 ± 1.6 mmViumor) than In nonlransgenic mice (lO.O ± 2.6 
mm'Aumor). Because of the small number of mice used in this smdy» 
ihe increases Observed in transgenic PKC fill did not reach statistical 
significance. However, the increases in all of these colon lumor 
parameters suggest that elevated expression of PKC ^11 in the colonic 
epithelium results in increased susceptibility to colon carcinogenesis 
and tumor formation. 

DISCUSSION 

Colon carcinogenesis is a multistep process that results from mul- 
tiple genetic and nongenetic alterations (1). Identification of the 
alterations that occur in the earliest detectable preneoplastic lesions 
may allow development of early colon cancer screening and preven- 
tion methods. Elevated expression of PKC ^11 has been associated 
with neoplastic transformation in both rat and human colonic epithe* 
lium (8. 1 1). In the present study, we assessed tlie timing and impor- 
tance of changes in PKC fill expression during colon carcinogenesis. 
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fig. 6. QRT*PCR onalysU for PKC a, pi, and ^11 mRNA in ACF and normal colonic 
epiihelium. Total RNA was isolated from pooled ACF from AOM>trea(ed mice or from 
normal colonic epithelium from saiine-treated mice. reverse-irBnscfibed, and amplified by 
PCR using PKC iwzyme-spedfie primen as described in '^Materials and Methods." The 
PCR pfoducts wcie eleciiophoresed on a I.S% agarose gel and stained with ethldiuro 
bromide. A, represeniailve reverse trenscripiion-PCR resulu are shown for each PKC 
isozyme. Each sample was normalized lo the P-adin levels. N, normal crypts from 
saline^treated mice; ACF, ACF from AOM-treaied mice. 0, quantitation of relative 
e;tpression of PKC isozyme mRNA by densiiomeiric scanning of the stained gel. Resultt 
are presented as Ihe average of iwo independent experiments. 



Transgenic PKC fill Mice Are More Susceptible to Develop* 
ment of Cardnogen-induced Colon Tumors. We have demon- 
surated chat transgenic mice that overexpress PKC j3II in the colonic 
epithelium are more susceptible to AOM-induced colon carcinogen- 
esis, as assessed by an increase in the total number of ACF and in the 
number of ACF of higher multiplicity than nontransgenic mice (19). 
The number and multiplicity of ACF have been demonstrated to be 
highly predictive of subsequent tumor formation in carcinogen- 
induced rodent models (32, 33). To further evaluate the effect of 
transgene expression on^ susceptibility to AOM-induced colon tumor 
formation, transgenic PKC pU mice and nontransgenic littermates 
were treated with AOM and analyzed for tumor formation as de- 
scribed in "Materials and Methods.*' Transgenic PKC /3II mice exhibit 
a higher incidence (9 of 15 mice; 60%) of colon tumors than non- 
transgenic mice (4 of 12 mice; 33%) in response to AOM treatment. 
In addition to tumor incidence* other tumor parameters also showed an 
upward trend in transgenic PKC fill mice. Thus, tumor multiplicity 
(average number of tumorsttumor-bearing mouse) increased from 
1.3 ± 0.3 (5 tumors in 4 nontransgenic tumor-l>earing mice) to 
1.5 ± 0.3 (14 tumors in 9 tfansgcnic PKC fill tumor-bearing mice). 
Tumor burden (average tumor area per tumor-bearing mouse) was 
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Pig. 7. Relative mRNA levels of PKC isozymes in AOM>induced mouse colon tumors 
and normal cotonic epiihelium. Total RNA (2 ;ig) was extracted from uninvolved colonic 
epiUtelium and tumor tissues from AOM-treaied mice, reverse*lranscribed, and amplified 
by PCR as described in ^'Materials and Methods.** A. repiesemative reverse tianMriptlon- 
PCR results for each PKC isozyme are shown. N, normal, uninvolved mouse epithelium; 
r. tumor tissue. 8, relative expression of PKC isozyme mRNA. was quontitated from five 
pairs of tumor and uninvolved tissue, and the mean t S£ value is presetvted for each PKC 
isozyme. ^,F< 0,05. 
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IJsIng the mouse carcinogen (AOM) colon carcinogenesis models we 
determined the levels of PKC a, and 011 expression in colon 
tumors and in ACP, the earliest preneoplastic lesions found in both 
AOM-treated niice and humans. 

Our findings suggest that PKC /3II plays a role at multiple stages of 
colon carcinogenesis and thai changes in more than one PKC isozyme 
may be involved in the muUistep process of colon lumorigenesis. In 
ACF, the level of PKC fill protein expression was strikingly increased 
compared wilh a small reduction in PKC a and ]3I protein expression, 
innplicating PKC /3II in early preneoplastic changes. In colon tumors, 
wc observed a significant elevation in PKC fill protein expression 
compared with that seen in normal colonic epithelium, whereas the 
expression of PKC <x and fil was significantly reduced or lost. There- 
fore, increased PKC (311 expression occurs in the early, preneoplastic 
stages of colon carcinogenesis and remains high in colon tumors. On 
the other hand, the dramatic decrease in PKC a and /3I protein 
expression in tumors but not ACF suggests that decreases in PKC a 
and /3I expression may be importanl for later stages of tumor devel- 
opment. Similar changes in PKC isozyme expression have been 
observed in human colon tumors,* emphasizing the relevance of the 
AOM-induced colon carcinogenesis model to sporadic human colon 
cancer. These results are In accordance with previous reports from our 
laboratory, and others have demonstrated that PKC pil promotes 
cellular proliferation in human leukemia cells and colon cancer cell 
lines (7, 34). A direct role for PKC pll in colon carcinogenesis has 
also been established by our recent finding that transgenic PKC jBII 
mice exhibit colonic hyperproliferation and Increased susceptibility to 
colon carcinogenesis, as determined by an increase in the total number 
of ACF and in the number of ACF of higher multiplicity as compared 
wilh nontransgcnic mice (19). In the present study, we extend these 
findings and report a trend toward higher tumor incidence in trans- 
genic PKC fill mice as compared with nontransgcnic mice. Average 
tumor burden, tumor size, and tumor multiplicity all showed an 
upward trend in transgenic PKC fill mice compared with nontrans- 
gcnic mice. Our recent data also demonstrate that elevated PKC jSII 
leads to inhibition of glycogen synthase kinase 30 (GSK-30) activity 
and an accumulation of l3-catenin (19), suggesting that PKC fill 
causes colonic hyperproliferation and promotes colon carcinogenesis, 
at least in part, by activating the adenomatous polyposis coli (APC)/ 
/3-catentn signaling pathway (19). 

Based on epidemiological studies in man (reviewed In Refs. 35 and^ 
36) and biochemical studies in rodent models (37, 38), it has been well 
established that diets high in certdn fatty acids play a promotive role 
in colon carcinogenesis. Furthermore, high-fat diets result in the 
production of fatty acid metabolites and secondary bile acids that 
stimulate lumenal bacterial phospholipases and activate colonic epi- 
thelial cell PKC by production of diacylglycerol (39, 40). Based on 
these data, we have proposed a model for PKC ^11 in colon carcino- 
genesis in which stimulation of PKC fill by various dietary lipid 
components results in the activation of the APC/0-catenin signaling 
pathway, resulting in hyperproliferation and an increase in suscepti- 
bility CO colon carcinogenesis (19), Based on our present findings, we 
propose that an increase in PKC |3II expression early in colon carci- 
nogenesis results in hyperproliferation and ACF formation. Our pre- 
vious studies demonstrated that traiisgenic PKC fill mice exhibit 
increased numbers of ACF and also increased multiplicity of ACF. 
suggesting a role for PKC fill in both initiation and progression (19). 
Future studies will address the issue of whether the expression level of 
PKC j3n in individual ACF correlates wilh genetic and morphological 
changes associated with progression to carcinoma. 

Interestingly, the expression of PKC fil protein, a splice variant of 
PKC fill, is dramatically reduced or lost in AOM-induced mouse 
colon tumors. These data suggest that PKC fil and PKC fiU play 



distinct and possibly opposing roles in colon lumorigenesis and that 
the differences in expression may be regulated at the level of alter- 
native splicing. A switch mechanism from PKC fii to fill mRNA has 
been reported in response to insulin in which alternative splicing 
results in increased protein levels of PKC ^11(18). In L6 skeletal 
muscle cells, insulin enhances the inclusion of a PKC ^BU-specific 
exon and (hereby specifically increases the expression of PKC fill 
protein (41). Experiments are underway to determine whether a sim- 
ilar switch mechanism is involved in the dramatic increase of PKC fill 
and loss of PKC fil protein expression during colon carcinogenesis. 

Expression of PKC a is dramatically reduced or lost in AOM- 
induced colon tumors, suggesting a negative relationship between 
PKC a and cell growth. Consistent with these findings, a compelling 
body of evidence indicates that PKC a is associated with negative 
growth regulation and cell cycle arrest in various cell lines (42-46), 
Our earlier data also demonstrated that overexpression of PKC o is 
involved in cytostasis and phorbol i2-myrtstate l3-acetale (PMA)- 
induced differentiation in human K562 erythroleukemla cells (7). 
whereas inhibition of PKC a expression blocks phorbol 12-myristate 
13-ac6tate-tnduced differentiation (3). In the nontransformed intestinal 
epithelial IEC-18 cell line, activation of PKC a results in agonist-induced 
cell cycle arrest and induction of Cip/Kip family of cyclin-dependent 
kinase inhibitors, concomiunt with the hypophosphorylation of the reti- 
noblastoma protein (Rb) (47). Furthermore, overexpression of PKC a 
causes inhibition of cellular proliferation and enhanced differentiation in 
CaCo-2 human adenocarcinoma cells, whereas antisense PKC a-irans- 
fecied CaCo-2 cells exhibit enhanced cellular proliferation and trans- 
formed phenotypc (45, 46). Nude mice injected with antisense PKC 
a-expressing CaCo-2 cells exhibited an increased number of tumors and 
an increased mmor burden compared widi mice injected with sense PKC 
a-expressing CaCo-2 cells (46). Additionally, a role for PKC « in cell 
adhesion has been implicated in the poorly differentiated colorectal 
carcinoma VOM cell line (48). PMA-induced activation of PKC ct 
restores normal epithelial morphology in VOM cells (48). These data, as 
well as those presented here, are consistent with the hypothesis that the 
loss of expression of PKC a during colon carcinogenesis allows prema> 
lignant cells to oveicome normal growth-inhibitory processes. It temains 
to be detemiined whether PKC cx plays a negadve growth-regulatory role 
in the colonic epithelium in vivo. 

To investigate the regulatory mechanisms responsible for the alter- 
ations of PKC isozyme expression, we analyzed mRNA expression of 
PKC a, fih and ^I1 in AOM-induced ACF and colon tumors. In both 
ACF and tumors, the level of PKC fill mRNA was increased, whereas 
the level of PKC a mRNA was decreased. However* although the 
changes in mRNA levels were significant, the dramatic changes in 
PKC isozyme protein expression could not be explained by transcrip- 
tional control alone. This is particularly true for PKC fil, where 
protein expression is almost undetectable in tumors, whereas the level 
of mRNA in tumors is slightly increased relative to normal epithe- 
lium. Other regulatory mechanisms that could contribute to the ob< 
served alterations in PKC isozyme expression include changes in 
message stability or translatabtlity. protein phosphorylation, and pro- 
tein stability. Future studies will explore the contribution of these 
mechanisms to PKC isozyme expression in colonic epithelial cells. 

In summary, the expression of PKC isozymes changes dramatically 
during colon carcinogenesis. Expression of PKC fill increases early in 
preneoplastic lesions (ACF) and is maintained in colon tumors. PKC 
a expression declines progressively throughout the carcinogenic proc- 
ess. A dramatic change in expression of PKC j3I also occurs later in 
the carcinogenic process, with essentially no detectable PKC 
remaining in colon tumors. Our data demonstrate that changes in 
mRNA levels for individual PKC isozymes do not coincide with 
alterations in protein expression. Therefore, caution shoukl be used in 



1380 



PKC m IN EARUY STAGES OF COLON CAWnNOOENESIS 



interpreiing quanlUaiive analysis of changes in PKC isozyme mRN A 
in the absence of informalion on protein expression. Future studies 
win focus on the delineation of the molecular mechanisms responsible 
for the alterations of individual PKC isozyme expression. Fmally, 
development of transgenic mouse models for individual PKC 
isozymes will provide valuable tools to analyze the role of individual 
PKC isozymes in colonic epithelial cell homeostasis and colon car- 
cinogenesis. 
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The t(5;14) Chromosomal Translocation in a Case of Acute Lymphocytic 
:i:.&ukemia Joins the Interleukin-3 Gene to the Immunoglobulin Heavy Chain Gene 

By J. Chnstopher Grimaldi and Timothy C. Meeker 



O^romosomal translocations have proven to be important 
rfvarkers of the genetio abnormalities central to the patho- 
genesis of cancer. By cloning chromosomal breakpoints 
orie can identify activated proto^ncogenes. We have etud- 
i&d a case of B-lineage acute lymphocytic leukemia (ALL) 
thast was associated with peripheral blood eoslnophiiia. The 
cfiromoeomal translocation t(5:14) {q31:q32) from this 
sample was cloned and studied at the molecular level. This 

KARYOTYPIC STUDIES of leukemia and lymphoma 
have identified frequent nonrandom chromosomal 
translocat'ons. Some of these translocations juxtapose the 
innimmoglobuUn heavy chain (IgH) gene with important 
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Fig 1. DNA blots of the leukemia sample. The restriction 
fragment pattern of normal human DNA (N) and the leultemia 
sample (L) were compared usmg a human Jh probe. Rearranged 
bands ere indicated by arrows. Sample L exhiblte e single rear- 
ranged band with both HmdIII/fcoRl and SaU3A restriction 
digests. The rearranged bands are less intense than the other 
bands because the majority of cells In the sample represent normal 
bone marrow elements. 



tranislocation Joined the immunoglobulin heavy chain Join- 
ing ( Jh) region to the promoter region of the interieukln-3 
(iL-3) gene in opposite transcriptional orientations. The 
data suggest that actuation of the IL-3 gene by the 
enhancer of the Immunoglobulin heavy chain gene may play 
a central role in the pathogenesis of tMs leukemia and the 
associated eosinophilia. 
a 1989 by Grune & Stratton^ Inc. 

protooncogenes, such as omyc and bcl-1}^ In this way, the 
IgH gene can activate protooncogenes, resulting in disor- 
dered gene expression and a step in the development of 
cancer. The investigation of additional nonrandom transloca- 
tions into the IgH locus allows us to identify new genes 
promoting the generation of leukemia and lymphoma. 

A distinct subtype of acute lymphocytic leukemia (ALL) 
has been characterized by B4ineage phenotype, associated 
eosinophilia in the peripheral blood, and a t(5;14){q31;q32) 
chromosomal translocation.'*^ This syndrome probably 
occurs in <1% of all patients with ALL. We hypothesized 
that the cloning of the translocation characteristic of this 
leukemia might allow the identification of an important gene 
on chromosome 5 that plays a role in the evolution of this 
disease. In this report we demonstrate that the interleukjn-3 
gene (IL-3) and the IgH gene are joined by this transloca- 
tion. 

MATERIALS AND METHODS 

Sample and DNA blots. A bone marrow aspirate from a repre- 
sentative patient with ALL (LI morphology by Frencb-American- 
Britlsb [FAB] criteria), peripheral eosinophilia (up to 20,000 per 
microliter with a normal value of <350 per microliter) and a 
t(5;l4)(q31;q32) translocation was studied. Using published meth- 
ods, genomic DNA was isolated and DNA blots were made.' Briefly, 
10 ;tg of high molecular weight (mol wl) DNA were digested using 
an appropriate restriction enzyme and electrpphoresed on a 0.8% 
Agarose "^1. The gel was stained with ethidium bromide, photo- 
graphed, denatured, neutralized, and'transferred to Hybond (Amer- 
sham, Arlington Hdghts, IL). After treatment of the filter with 
ultraviolet light, hybridization was performed. The filter was washed 
to a final stringency of 0.2% saturated sodium citrate (SSC) and 
0.1% sodium lauryly sulfate (SDS) and exposed to film. The human 
Jh probe has been previously reported.* 
Genomic library. The genomic library was made using pub- 
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lisbed methods * Approximately 100 of high mol wt genomic 
DNA were partially digested with the Sau3A restriction enzyme. 
Fragments from 9 to 23 kUobases (kb) m size were isolated on a 
sucrose gradient and ligatcd into phage EMBU A (Stratcgenc, San 
Diego). Recombinant phage were packaged, plated, and screened as 
previously reported * 

DNA sequencing. Fragments for sequencing were cloned into 
M13 vectors and sequenced by the chain termination method using 
Sequenase (United States Biochemical, Cleveland),' All sequence 
data were derived from both strands. 

RESULTS 

We studied a bone marrow sample from a patient with 
ALL and associated peripheral eoslnophilia. Karyotypic 
analysis showed the characteirisUc t(5;I4)(q3l;q32) translo- 
cation. These features define a distinctive subtype of ALL.'^ 
The leukemic cells were analyzed for cell surface phenotypc 
by immunofiuorecence. They were positive for 81 (CD20) 
B4 (CD19). cALLA (CDIO), HLA-DR, and terminii 
dcoxynucleotidyl transferase (Tdt), but negative for surface 
immunoglobulin. This phenotypic profile describes an imma- 
ture cell from the B-Iymphocytic lineage.' 

The leukemia DNA was analyzed by Southern blotting for 
rearrangements of the IgH gene. Using a human immuno- 
globulin Jh probe, a single rearranged band was detected by 
EcoKl, Hindm, Sstl, 5a«3A, and EcoKi plus Hindlll 
restriction digests, suggesting rearrangement of one allele 
(Fig 1). The immunoglobulin Jh region from the other allele 
was presumably either deleted or in the germline configura- 
(ion. 

We hypothesized that the t(5;14)(q31;q32) juxtaposed a 
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growth-promoting gene on chromosome 5 with the immuno- 
globulin Jh region on chromosome 14. Therefore, a genomic 
library was made from the leukemic sample and screened 
with a Jh probe. Fifteen distinct positive clones were isolated 
and screened for the presence of the rearranged Sau3A 
fragment that was detected by DNA blotting. By this 
analysis, five clones appeared to represent the rearranged 
allele identified by DNA blots. One of these clones (clone no. 
4) was chosen for further study and a detailed restriction 
map was generated. The EcoRl, Hindlll/ BcoRl, and Sstl 
fragments from clone no. 4 thai hybridized to the human Jh 
probe were also identical in size to the rearranged fragments 
from the leukemia sample,, confirming that clone no. 4 
represented the rearranged leukemic allele. 

Phage clone no. 4 contauied 3.7 kb of unknown origin 
joined to the IgH gene in the region of Jh4 (Fig 2). The IgH 
gene from Jh4 to the Cmu region appeared to be in germline 
configuration. Previously, the gene encoding hematopoietic'' 
growth factor 11^3 had been mapped to chromosome 5q3 1 so 
it was suspected that clone no. 4 might contain part of this 
gene.*-" When the restriction map of human IL-3 and clone 
no. 4 were compared, they were identical for more than 3 kb 
(Fig2). 

We confirmed the juxtaposition of the 11^3 gene and the 
IgH gene by nucleic acid sequencing of the subcloned 
Bsmi/Hpal fragment (Fig 2). The sequence of this frag- 
ment showed no disruption of the protein coding region or the 
messenger RN A of the IL-3 gene. The break in the 11^3 gene 
occurred in the promoter region. 452 base pairs (bp) 
upstream of the transcriptional start site (position 64, Fig 
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The break in the IgH gene occurred 2 bp upstream of 
the^ Jti^ region. Between the two breaks, 25 bp of uncertain 
origin (putative N sequence) were inserted."'" No sequences 
horwlogous to the immunoglobulin faeptamer and nonamer 
Gois^Id be identified in the IL-3 sequence (Fig 3B). Therefore, 
mcslw hind sequencing confirmed the juxtaposition of the 
IL^3 gene and the IgH gene. The sequence data clearly 
shoved that the genes were positioned in opposite transcrip- 
tional orientations (head-to-head). 

yWailable data also allowed us to determine the normal 
positions of the IL-3 gene and the GM-CSF gene in relation 
to -the centromere of chromosome 5 (Fig 4). The IgH gene is 
knovv^n to be positioned with the variable regions toward the 
tdlcnnereon chromosome 14q.'*" It has also been shown that 



2083 

GM-CSF maps within 9 kb of IL-3 in the same transcrip- 
tional orientation.*^ Usmg this information and assunung a 
simple translocation event in our sample, we can conclude 
that the IL-3 gene is normally more centromcric, and the 
GM'CSF gene more telomeric on chromosome 5q (Fig 4), 
Furthermore, both arc transciibed with their 5' ends toward 
the centromere. 

DISCUSSION 

In this report wc have cloned a unique chromosomal 
translocation that appears to be a consistent feature of a rare, 
yet distinct, clinical form of acute leukemia. This transloca- 
tion joined the promoter of the IL-3 gene to the IgH gene. 
Except for the altered promotor, the IL-3 gene appeared 



• • • • 

5 ' G<SrGACCAG6GTTCCCTGGCCCCAGTAGTCAAAGTAGTAGAGCTAATTCATCATAGCTGCGGATT^ 
3 » CCACTGGTCCCAAGGGACCGGGGTCAMAGTTTCATC ATCTCCATTAAGTAGTATCGACGCCTM^ 

5 'TACCAGACAAACTCTCATCTCTTCCAGTGGCCTCCTGGCCACCCACCAGGACC^ Igg 
3'ATGGTCT6TXTGAGAGTAGACAAGGTCACCGGAGGACCGGTGGGTGGTCCTGGTTCGTCCCGCCCGTCGTCTC^ 

^ ********* » • • 

5 'GTAGTCCAGGTGATCGCAGATGAGATCCCACTGGGCy^GGAGGCCTCAGTGAGCTGAGTCAGGCTTC ,40 
3 » C ATCAGGTCCACTACCGTCTACTCTAGGGTCACCCGTCCTCCGGAGTCACTCGACTCAGTC 

• . • • 

5 • GGGGTCCTCTCACCTGCTGCCATGCTTCCCATCTCTCATCCTCCOTOACAAGATG^^ 320 
3 » CCCCAGGAGAGTGGACGACGGTACGAAGGGTAGAGAGTAGGAGGAACTGTTCTACTTCACTATC^ 

********* 

• • • • 

5 ' TTTCTTCTTTC ACTGATCOTGAGTACTAGAAAGTCATGGATGAATAATTACGTCTGTGCiTOT ^ g g 

3 ' AAAGAACAAAGTGACTAGAACTCATGATCTOTCAGTACCTAC WATTAATGCAGACACCAAAAGAT^^ 



5 • C AGATAAAGATCCTTCCGACGCCTGCCCCACACCACCACCTCCCCCCGCCTTGCCCGGGG^ -op 
3 ' 6TCTATTTCTAGGAAGGCTGCGGACGGGGTGTGGTGGTGGAGGGGGGCGGAACGGGCCCCAAC ACCCGTGGAACGACG AC 



5 • CACATar&AGGCGGGAGGTTGTTGCCAACrcTTCAGAGCCCCACGAAGGACCAGAAC^ c/; ^ 
3 ' GTGTATATTCCGCCCTCCAACAACGGTTGAGAAGTCTCGGGGTGCTTCCTGGTCTTGTTCTGTCTC 

5 ' CCAAACATGAGCCGCCTGCCCGTCCTGCTCCTGCTCCAACTCCTGGTCCGCCCi^^^ g ^ C 

3 • GGTTTGTACTC6GCGGACGGGCAGGACGAGGACGAGGTTGAGGACCAGGCGGGGCCT6AGGTTCGAGGGTACTCGGTC ' 



S'AACGTCCTTGAAGACAAGCTGGGTTAAC 3' ggg 
3"TTGCAGGAACTTCTGTTCGACCCAATTG 5' 

B5 • TGGCCCCAGTAGTCAAAGTAGTCACATTGTGGGAGGCCCCATTAAGGGGTGCAC AAAAACCTGACTCTC 
^9 Jn4 3 , ACCGGGGTCATCAGTTTCATCAGTGTAACA CCCTCCGGGGTAATTCCCCACGTGTTTTTGGACTGAGAG 

++++++++++++++++++++++ 
, ^ ^ 5 ' TGGCCCCAGTAGTCAAAGTAGTAGAGGTAATTCATCATAGCTGCGGATTAGCAGCGTGACCGGCTACCA 
• * ^ 3 • Arrrsr^rtCTrATrAfiTTTrRTC ATCTCCATTAA GTAGTATCGACGCCTAATCGTCGCACTGGCCGATGGT 

++++++++++++++++++++++ 

5 • GGCACCAAGAGATGTGCTTCTCAGAGCCTGAGGCTGAACGTGGATGTTTAGCAGCGTGACCGGCTACCA 
^^"^ 3 • CCGTGGrrCTCTACACGAAGAGTCTCGGACTCCGACTTGCACCTACAAATCGTCGC ACTGGCCGATGGT 

Fig 3. Sequence of t(5;14)(q31;q3i) breakpoint region. (A) Nucleotide sequence of the BsmWpel fragment indicated on Fig 2. 
Nucleotides 1 to 36 represent the Jh4 coding region underlined on the coding strand.' Nucleotides 39 to 63 ere a putative N region. The 
sequence from position 64 to 668 U that of the germllne iL-3 gene.** The 11-3 TATA box I485h transcription start C51B>, and Initiation 
methionine (567) are underlined. Two proposed regulatory sequences In the promoter are marKed by asterisks (positions 182 and 389). (B) 
Comparoth^e sequence of the tl5;14)lq31;q32) breaicpolnt region. The lgJh4 region is shown with hs coding region, heptamer, and 
nonamer underlined. Clone no. 4 Is shown with putatWe N region sequences underlined. The IL-3 sequence Is also shown. A plus sign (+) 
denotes the Identical nucleotide between sequences* No heptamer or nonamer Is identified in the IW sequence. 
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Fig 4. Diagram of tho translocation. Tha normat chromosome 
5q31 i9 shown with tho GM-CSF gene talomerio to tho IL-3 gone in 
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14q32 tho Vh roslons are tolomeric. Tho t(5;14J(q31;q32} translo- 
cation results in the head-to-head orientation of these genes. 
Symbols are defined in Rg 2. BP, breakpoint position. 



intact as no deletions, insertions, or point mutations were 
detected by restriction mapping of the entire gene and 
sequencing of part of the gene. The IgH gene has been 
truncated at the Jh4 region, which places the immunoglobu- 
lin enhancer within 2.5 kb of the IL-3 gene."-" This leads to 
the hypothesis that the enhancer is increasing transcription 
of a structurally normal IL-3 gene. The same mechanism is 
miporUnt for activation of the c-myc gene in some cases of 
Byrkitfs lymphoma." An alternate hypothesis is that the 
ehmmation of an upstream IL-3 promoter element is crucial 
to the activation of the IL-3 gene. 
The proposed activation of the IL-S gene stiggests that an 
iniportant for the pathogenesis of this 
leukemia.^ Over-expression of the II.3 gene coupled with 
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the presence of the IL-3 receptor in these cells could account 
for a strong stimulus for proliferation. In this r^gatd, there 
are data indicating that immature B-lineage lymphocytes 
and B-tineage Icukemias may express the 11^3 receptor.^'-^ 

An additional feature of this type of leukemia is the 
dramatic eosinophilia, consisting of mature forms. It has 
been hypothesized that the eosinophils do not arise from the 
malignant clone, but are stimulated by the tumor 
Because of the known effect of IL-3 on eosinophil differen tia- 
tion, secretion of high levels of IL-3 by leukemic cells might 
hav€i a role in the cosinophilia in this type of leukemia. 

The data suggest that the recombination mechanism that 
is active in the IgH gene during normal differentiation has a 
role in this translocation.^*-" This is supported by the break- 
point location at the 5' end of Jh4 and the presence of 
putative N-rcgion sequences. On the other hand, no recombi- 
nation signal sequence (heptamer and nonamer) was found 
in this region on chromosome 5, suggesting that additional 
factors also played a role. Further studies will elucidate the 
mechanism of this and other translocations.. 

In the leukemia we studied, it Is possible that the immuno- 
globulin enhancer also activates the GM-CSF gene, since 
this gene is probably positioned only 14 kb away (Fig 4). This 
is known to be within the range of enhancer activation." The 
intcrleukin-5 (IL-5) gene maps to chromosome 5q3i.» 
Deregulation of the IL-5 gene by this translocation would act 
synergistically with TL-3 in the stimulation of eosinophil 
proliferation and differentiation." These and other questions 
will be answered by the study of more patient samples; We 
plan to determine whether the t(5;l4)(q3l;q32) transloca- 
tion is capable of acthrating multiple lymphoWncs simulta- 
neously and whettier they cooperate in the generation of this 
leukemia. 
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RAPID COMMUNICATION 

Activation of the Interieukiii-3 Gene by Chromosome Translocation in Acute 
Lymphocytic Leukemia With Eosinophilia 

By Timothy C. Meeker, Dan Hardy, Cheryl Wlllman, Thomas Hogan, and John Abrams 



The t(6n4)C<t31xi32) translocation from B-imeage acute 
lirmphooytle leukemia with eosinophilia has been cloned 
from two leukemia samples. In both, cases, this transloca- 
tten Joined the IgH gene and the interleukin-S CIL-3) gene. In 
one patient, excess IL-3 mRNA was produced by the 
leukemic cells. In the second patient, serum IL-3 levels 
were measured and shown to correlate whh disease 

A NUMBER OF chromosome translocations have been 
associated with human leukemia and lymphoma. In 
many cases the study of these translocations has led to the 
discovery or characterization of proto-oncogcnes, such as 
bcl-2, Qrabl, and omycy that are - located adjacent to the 
translocation,'*^ It is now widely understood that cancer- 
associated translocations disrupt nearby proto-oncogenes. 

A distinct subtype of acute leukemia is characterized by 
the triad of B-lineage inununophenotype, eosinophilia, and 
the t{5;14)(q3l;q32) translocaUon.^*^ Leukemic cells from 
such patients have been positive for terminal deoxynucleotidyl 
transferase (Tdt), common acute lymphoblastic leukemia 
antigen (CALLA), and CD19» but negative for surface or 
cytoplasmic immunoglobulin. In previous work, we cloned 
the t(5;14) breakpoint from one leukemic sample (Case 1) 
and determined that the IgH and interleukin-3 (11^3) genes 
were joined by this abnormality.* In this report, we extend 
those findings by showing that the t(5;14)(q31;q32) translo- 
cation from a second leukemia sample (C^ 2) has a similar 
structure, and we report our study of growth factor expres- 
sion in these patients. 

MATERIALS AND METHODS 
Samples and Southern blots. Case 1 has been described.^-^ 
Clinical features of Case 2 have been <|escribed m detail.^ DNA 
isolation and Southern blotting was done using previously d^cribed 
methods.^ Filten? were bybridized with an unmunoglobulm Jh probe, 
a 280 bp BamHl/Bcom genon^c fragment, and an IL-3 
cDNAprobe.^* 

Northern blots, RNA isolation and Northern blotting have been 
described.* Briefly, Northern blots were done by separating 9fi$ 
total RNA on 1% agarose-fonnaldehyde gels. Equal RNA loading in 
each lane was confirmed by ethidium bromide staining. Blots were 
hybridized with an IL-3 cDNA probe extending to the Mo I site in 
exon 5. a 720 hpSst 1/Kpn 1 probe derived from intron 2 of the IL-3 
gene, a 600 bp Nh< l/Hpa I IL-5 cDNA probe, and a 500 bp Pst 
l/Nco I granulocyte-macrophage colony stimulating factor (GM- 
CSF)cDNA probe.'^»2 

Polymerase chain reaction. Primers were designed with BamHl 
sites for cloning. One primer hybridized to the Jh sequences from the 
IgH gene (Primer 144:5'-TAGGATCCGACGGTGACCAGGGT), 
and the other hybridized to the region of the TATA box in the IL-3 
gene (Primer 161: S'-AACAGGATCCCCKXTTATATCTGCAG), 
Polymerase chain reaction (PGR) (95*»C for I ndnute. 61^ for 30 
seconds, and 72°C for 3 minutes) was done ushig 500 og genomic 
DNA and 50 pmol of each primer in 100 mL contaming 67 mmol/L 
Tris-HCl pH 8,8. 6.7 mmol/L Mga,, 10% dusethyl sulfoxide 
(DMSO). 170 Mg/mL bovine serum albumm (BSA) (fraction V), 



activity. There was no evidenco of excess granulocyte/ 
macrophage colony stimulating factor <GM-CSF) or lUB 
expression. Our data support the formulation that this 
subtype of leukemia may arise in part because of a 
chromosome translocation that activates the iL-3 gene, 
resulting in autocrine and paracrine growth effects. 
© 1990 by The American Society of Hematology, 

16.6 mmol/L ammonium sulfate, 1.5 mmol/L eadi dNTP and Taq 
polymerase (Perkui-Elmer, Norwalk, CT) J' 

Sequencing, Sequencing was done by chain termination in M 1 3 
vectors," As part of this study, we sequenced a subclone of a normal 
IL-3 promoter, covering 598 base pairs from a Sma I site at position 
-1240 (with respect to the proposed sitg'of transcription initiation) 
to an Nhe I site at position -642. The plasmid containing this region 
was a gift from Naoko Arai of the DNAX Research Institute. 

Expression in Cos? cells, A genomic 11^3 fragment from Case 1 
was cloned into the pXM expression vector.*'^ Briefly, the Hindlli/ 
Sal J fragment containmg the IL-3 gene was subcloned from the 
previously described phage clone 4 into pUClS.^ The 2.6 kb 
fragment extending from the Sma I site 61 bp upstream of the IL-3 
transcription start to the Sma I site in the polyUnker was cloned into 
the blunted Xho I site of pXM. The negative control construct was 
the pXM vector without insert Plasmids were introduced mto 0>s7 
ceUs by electroporation, and supernatant was collected after 48 
hours in culture. 

TFl bioassay. TF-1 cells were passaged in RPMI 1640 supple- 
mented with 10% hcat^inactivated fetal bovine scrum, 2 mmol 
L-glutamine,and 1 ng/mL human GM-CSF.'^ Samples and antibod- 
ies were diluted in this same medium lacking GM-CSF but contain- 
ing penicillin and streptomydn. A 25 fiL volume of serial dilutions of 
patient serum was added to wells in a flat bottom 96-weIl microtiter 
plate. Rat anti-cytokine monoclonal antibody m a volume of 25 pL 
was added to appropriate wells and preincubated for 1 hour at 37*^. 
Fifty microliters of tmce washed TF-1 cells were added to each well, 
giving a final cell concentration Of 1 x. 10* cells per well, (final 
.volume. 100 pL). the plate was incubated for 48 honrs^ The 
remainuig cell viablUty was determined metaboUcally by the oolori- 
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Breakpoint saquonces for Cbso 2. The oarmllne IgJhS region sequence (protoln codino rBaia*. ..-^^ ..i - 
6ec,«encaa.rettnderii„ed)l.ontop,th IflnaJ 
m the middle, and the garmllne IM aequence, which we derived Irom . normal IL^ tl^TontU^^^^ und«r«ned) 
sequence has the same nucleotide. The eequence documenta the head-to^ead Joining of the 11-3 and moVi^hab^^^. . ***** 
gene occurred at position -934 (•). «» ana ign genes. The breakpoint la the IL^ 



metric method of Mosmann usbg a VMax microtitcr plate reader 
(Molecular Devices, Menio Park. CA) set at 570 and 650 nm. 

Cytokine Immunoassays, These assays used rat monoclonal 
anti-cytoldne antibodies (10 Mg/mL) to coat the wclb of a PVC 
miciotUcr pjatc. The capture, antibodies us«I were BVD3-6G8, 
JES1.39D10, and BVD2-23B6, for the BUS, and GM-CSF 
assays, respectively. Patient sera were then added (undUuted and 
diluted 1:2 for 11^3, undUuted for IL-5. and undiluted and diluted 
1:5 for GM-CSF). The detecting immunoreagente used were either 
mouse antiserum to IL-3 or niUoiodophenyl (NIP)-dcrivatized rat 
monoclonal anUbodies JES1-5A2 and BVD2-21C11, specific for 
11^5 and GM-CSF, respectively. Bound antibody was subsequently 
detected with iromunoperoxidase conjugates: horseradish peroxidase 
(HRP)^beled goat anti-mouse Ig for IU3. or HRP-Iabclcd rat (J4 
MoAb) anti-NIP for IL-5 and GM-CSF. The chromogenic sub- 
strate was 3-3'azino-bis-ben«hia2oline sulfonate (ABTS; Sigma. St 
Louis, MO). Unknown values were interpolated from standard 
curves prepared from dilutions of the recombinant factors using 
Softmax software available with the VMAX microplate reader 
(Molecular Devices). 

RESULTS 

Leukemic DNA from Case 2 was studied by Southern 
blotting. When digested with the HindUl restriction enzyme 
and hybridized with a human immunoglobulin heavy chain 
joining region (Jh) probe, a rearranged fragment at approxi- 
, matcly 14 kb was detected (data not shown). When reprobcd 
with either of two different IL-3 probes, a rearranged 14 kb 



#2 



#1 



hi 



fragment, comigrating with therearraiigedJli fiagment, was 
ident^ed- Whea leukemic DNA was digested with ^rindin 
ptas &»W. a reairanged Jh fragment was detected at 6 fcb 
Tie IL.3 probes also identified a comigrating fragment of 
this size. These experiments Indicated that the leukemic 
sample studied was clonal and that a single fragment 
contamed both Jh and lL-3 sequences, suggesting a tiSlo- 
cation had occurred. 

To charactwizc better the joining of the IL-3 gene and the 
immunoglobulin heavy chain (IgH) gene, the polymerase 
Cham leacuon (PCR) was used to clone the translocation « 
A Jh pnmer and an 11,3 primer were designed to produce an 
ampUfied product in the event of a head-to-head transloca- 

Uon WhUecoutrolDNAgaveno PGR product, Case2 DNA 
yielded a PCR-derived fragment of approximately 980 bp 
which was cloned and sequenced. 

•ne DNA sequence of the translocation clone from Case 2 
confinned the joining of the Jh region with the promoter of 
tne u^3 gene m a head-to^head configuraUon (Fia 1) 
Sequence analysis indicated that the breakpoint on chromo^ 
some 14 was just upsUeam of the Jh5 coding region The 
breakpoint on chromosome 5 occurred 934 bp upstream of 
ttie putative site of transcription initiation of the IL-3 gene 
We also determined that a putative N sequence of 1 7 bp was 
mserted between thTchr^mosome 5 and chromosome 14 
sequences during the translocation event." '* Figure 2 shows 
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FIfl Rslatlonship of chromosome B breakpoints to the IL-3 gms. Thb fiaure shows the two ctoned braaferninM i^^^^x ■ 
the normal IL-3 gsne."' One breakpoint occurred at position -462 and the other at -^34 (o^f^t^* m ' 
translocations resulted in a head-to4tead Joining of the IgH gene and the IL-3 gene, leaving the mRNA a jDrotafai Ha.!™ 
gen. Boxes denote the five 0.-3 axons: rLtrletion <i^s are (B) B^U <P) Pstl. (H) ISp- UE?Kw5to Atel 
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-»SlLiS2T^!?l!!? . u a Northern blot was preparod and hybridized with a probe for IL^. Une 1 

m^Z^ T^^T^I^A ^ ^^:^:J":f, f T***.*""^ "^"^ ^^"^ ^mu^^ with ConA for 48 houre. As In the po.lZ contr^^ 

^^^^I^Xf^!"' ""^i; civerwixpre«»lon of an unspllced 2.9 kb IL^ transcript (tone 4, upper «rrowK Wo 

i^^TnC 2 i^^^^^ r •"'^^ ^ ^ band hybiiJized to rprobe 

2^^aWe in n^m^^ " TV ^.f ^'^rl'^ ^ »«09ostin9 that a» IL-3 mRNA of this size U som^^es 

Analyzed by Northern blot because too few ceils were availabte for study. whw ^ couki not oe 



<he locations of the two cloned breakpoints in relation to the 
JLr-3 gene. Hie two chromosoine 5 brealqxnnts were sepa- 
rated by less than SOO bp. 

The genomic structure in Cases 1 and 2 suggested that a 
mormal IL-3 gene product was over-expressed as a result of 
the altered promotor structure. This would predict that the 
IL-3 gene on the translocated chromosome was capable of 
making IL-3 protein. This prediction was tested by express- 
ing a genomic fragment from the translocated allele of Case 
1 containmg all five IL-3 exons under the control of the S V40 
fMTomotor/enhancer in the Cos? cell line. Cell supematants 
vrere studied in a proliferation assay using the factor depen- 
dent crythroleukemic cell line, TF-1. The supematants' 
derived from transfections using the vector plus insert 
supported TF-I proliferation, while supernatants from trans- 
fections using the vector alone were negative in this assay 
<data not shovQa). Furthermore, the biologic activity could be 
blocked by an antibody to human IL-3 (BVD3-6G8). This 
result showed that the translocated allele retained the ability 
to make IL-3 mRNA and protein. 

The level of expression of IL-3 mRNA in leukemic cells 
from Case 1 was assessed. Northern blotting showed that the 
mature IL-3 mRNA (approximately 1 kb) and a 2,9 kb 
unspliced IL-3 mRNA were excessively produced by the 
leukemia (Fig 3). The 2.9 kb form of the mRNA is also 
present at low levels in normal peripheral blood T lympho- 
cytes after mitogen activation (Fig 3). Several B-lineage 
acute leukemia samples without the t(5;14) translocation 
had undetectable IcveU of IL-3 mRNA in these experiments. 
In addition, although genes for GM-CSF and IL-5 map close 
to the 11^3 gene and might have been deregulated by the 
translocation, no IL-S or GM-CSF mRNA could be detected 
in the leukemic sample (data not shown)."'*" 

Three serum samples from Case 2 were assayed by 
immunoassay for levels of IL-3. GM-CSF, and IL-5 (Table 
1). Serum lL-3 could be detected and correlated with the 
clinical course. When the patient's leukemic cell burden was 



highest, the IL-3 level was highest. No serum GM-CSF or 
IL-S could be detected. 

Since the IL-3 immunoassay measured only immunoreac- 
tive factor, we confimed that bioIogicaJly active IL-3 was 
present by using the TF-1 bioassay. This bioassay can be 
rendered monospecific using appropriate neutralizing mono- 
clonal antibodies specitic for IL-3, IL-5, or GM-CSF. We 
observed that sera from 1-16-84 and 3-14-84 contained TF-1 
stimulating activity that could be blocked with anti-IL-3 
MoAb (BVD3-6G8), but not with MoAbs to IL-5 (JESl- 
39D10) or GM-CSF (BVD2.23B6) (Fig 4; GM-CSF data 
not shown). The amount of neutralizable bioactivity in these 
two samples correlated very well with the difference in IL-3 
levels obtained by immunoassay for these samples. Further- 
more, the failure to block TF-1 proliferating activity with 
either anti-IL-5 or anti-GM;CSF^was consistent with the 
inability to measure theie" f actoii by immunoassay .and 



Table 1. Peripheral Blood Counts and Growth Factor Levels 
at Differont Times hi Case 2 



Samplo Date 





11/16/83 


1/16/84 


3/14/84 


Peripheral blood counts (cells/fiL) 








WBC 


81,800 


116,600 


12.300 


Lymphoblssts 


0 


33,786 


0 


Eosinophils 


48,626 


73,080 


616 


Serum growth factor levels (pg/mL) 








IL-3 


■ <444 


7.995 


1,051 


GM-CSF 


<16 


<16 


<15 


IL-5 


<50 


<60 


<60 



_ _ w» www Wlim VKl UIII9 pwillMi WIU1 

the correspondino growth factor levels quantified by Immunoassay. The 
patient received chemotherapy between 1/16/84 and 3/14/84 to lower 
his leukemic burden.' No serum samples were available for a simflar 
anelyslsofCase 1. 
Abbreviation: WBC, white blood cells. 
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vertical axis. Serum frim, an three time pplnta was Msay^ 
concentration of mono<Ho„al ret antWL^, BVDmS^^^ 

inhibition of proliferation was evidem ''^^^e^ncVof «^^^ 7 ^^.f ^» ^ ^.^^^^ and 3/14/84, 

was not detected In this assay, as antWL-B dW not after Xf^Ur^^^ documentlnfl serunf levels of IL-3 on thoeo days. SBrum H^ 



indicated that these other myeloid growth factors were not 
detcctably drcalating in the scrum of this patient. 

DISCUSSrON 

In this report, we have extended our analysis of acute 
lymphocytic leukemia and cosinophilia associated with the 
t(5; 14) translocation. In both cases we have studied, we have 
documented the joining of the IL-3 gene from chromosome 5 
to the IgH gene from chromosome 14. The breakpoints on 
chromosome 5 are within 500 bp of each other, suggesting 
that additional breakpoints will be clustered in a small region 
of the IL-3 promoter. The PGR assay we have developed will 
be useful in the screening of additional cliuical samples for 
this abnormality. 

The finding of a disrupted 11^3 promotor associated with 
an otherwise normal IL-3 gene implied that this transloca* 
tion might lead to the overTCxprcssion of a normal IL-3 gene 
product. In this work, we have documented that this is true 
In addition, neither GM-CSF nor f L-5 areVer-expressed by 
the leukemic cells. Furthermore, in one patient, serum IL-3 
could be measured and correlated with disease activity. To 
our knowledge, this is the first measurement of human IL-3 
in serum and its association with a disease process. The 
measurement of serum IL-3 m this and other clinical settings 
may now be indicated. 



The finding of the IL.3 gene adjacent to a cancer- 
associated translocation breakpoint suggests that its activa- 
tion is important for oncogenesis. It is our thesis that an 
autocrine loop for IL-3 is important for the evolution of this 
leukemia. The excessive IL-3 production that we have 
documented would be one feature of such an automae loop 
The final proof of our thesis must await additional data In 
particular, from the study of additional clinical samples, it 
will be necessary to document that the IL-3 receptor is 
present on' the leukemic cells and that ana-lL.3 antibody 
decreases proliferation of the leukemia in vitro. 

An important aspect of this work is the suggestion of a 
therapeuticapproach for this disease. If an autocrine loop for 
IL-3 can be documented in this disease, attempts to lower 
circulatmg IL-3 levels or block the interacUon of IL-3 with 
Its receptor may prove useful. Because it is also possible that 
the eosinophilia in these patients is mediated by the para- 
crme efi-ects of leukemia-deny^ nt^3, simihir mterventions 
may improve this aspect of the diseai. Antibodies or '' 
engmeered ligands to accomplish these goals may soon be 
available. 
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Clinical and Pathologic Significance of the 
c-erdB-2 {HER-2/neu) Oncogene 

Timothy R Singleton and John G. StricWer 



Hie c-^&B-2 oncogene was first shovm to have clinica] significance in 1987 by 
Slamon et si,'^ who reported that c^erbB-Z DNA amplification in breast carcino- 
njas correlated with decreased survival in patients with metastasis to axillary 
Tymph nodes. Subsequent studies, however, of c-criB-2 activation in breast 
carcinoma reached conflicting conclusions about its clinical significance. This 
oncogene also has been reported to have clinical and pathologic implications in 
other neoplasms. Our review summarizes these various studies and examines 
the clinical relevance of c-eriB-2 activaUon, which has not been emphasized in 
recent reviews. a^M,65 fhe molecular biology of the c-er6B-2 oncogene has been 
extensively reviewed^^-"'® and will be discussed only bnefly here. 

■ -^i 

BACKGROUND ( 

j 

The Q-erbB^i oncogene was discovered in the 1980s by three lines of investlga- f 
tion. The neu (mcogene was detected as a mutated transforming gene in \ 
neuroblastomas induced by ethylnitrosurea treatment of fetal rats.^^'^^'HTO -Jhe 
cr6B-2 was a human gene discovered by its homology to the retroviral gene v- 
€r&B.».<9,Y6 HER'2 was isolated by screem'ng a human genomic DNA library for 
homology with v-erbB.^ When the DNA sequences were determined subse- 
quently, c-er6B-2, and neu were found to represent the same gene. 
Recently, the c-er&B-2 oncogene also has been referred to as NGL, 

The c-erfcB-2 DNA is located on human chromosome rtqZl^^^^ and codes 
for c-crfcB-2 mENA (4.6 kb), which translates c-er6B-2 protein (pl85). This 
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protein is a norma] component of cytoplasmic membranes. The c-eriB-2 
oncogene is homologous with, but not identical to, c-crfeB-1, which is located 
on chromosome 7 and codes for the epidermal growth factor rcccptor.*'i*»^Thc c- 
er&B-2 protein is a receptor on cell membranes and has intracellular tyrosine 
kinase activity and an extracellular binding domain.^*** Electron microscopy 
, with a polyclonal antibody detects o-€r&B-2 inununoreactivity on cytoplasmic 

membranes of neoplasms^ especially on microvilli and the non-villous outer cell 
membrane.^ In normal cells, immunohistochemical reactivity for C'erbB-2 is 
frequently present at the basolateral membrane or the cytoplasmic membriane's 
brush border. 28.62 

There is cxpeiimental evidence that fyerbB'2 protein may be involved in 
the pathogenesis of breast neoplasia. Overproduction of otherwise normal c- 
erbB-2 protein can transform a cell line into a malignant phenotype,^ Also, 
when the neu oncogene containihg^'ari activating point mutation is placed in 
transgenic mice with a strong promoter for increased expression, the mice 
develop multiple independent mammary adenocafcinomas.^^'^ In other experi- 
^0 ]; ments, monoclonal antibodies against die neu protein inhibit the growth (in 

nude mice) of a neu-transformed cell line,"^" and. immunization of mico with 
neu protein protects them from subsequent tumor challenge with the neu* 
transformed cell llne.^^ Some authors have speculated that the use of antago- 
tfi'} nists for the unknown ligand could bo useful in future chemotherapy.^ Further 

!| i^j V review of this experimental evidence is beyond the scope of this artidie. 

The c-«rfcB-2 activation most likely occurs at an early stage of neoplastic 
(il- _ development This hypothesis is supported by the presence of c^6B-2 activa- 

'1 -..! Hon in both (n situ and invasive breast carcinomas. In addition, studies of 

'•l i . metastatic breast carcinomas usually demonstrate uniform o-erfeB-2 activation 

it J ^ at multiple sites in the same patient, although o^bB-2 activation has 

;i? I rarely been detected in metastatic lesions but not in the primary tumor;*^*®^*^^ 

|.< i j Even more rarely, c-er&B-2 DNA amplification has been detected in a primary 

if; i' .! breast carcinoma but not in its lymph node metastasis.^ In patients who have 

bilateral breast neoplasms, both lesions have similar patterns o{i>erbB-'2 activa- 
tioii, but only a few such cases have been studied.^^ 



MECHANISMS OF ACTIVATION 



The most common mechanism of c-erfcB-2 activation is genomic DNA amplifica- 
tion, which ahnost always results Id overproduction of c-erbB'*2 raRNA and 
protein. ".a4.65,Bi c-erfeB-2 amplification may stabilize the overproduction of 
jj ! mKNA or' protein through unknown mechanisms. Human breast carcinomas 

with c-«r&B-2 amplification contain 2 to 40 times more c-erbB-2 DNA**^ and 4 to 
i • 128 times more c-0rbB-2 rnRNA^*^ than found in normal tissue. Most human 

i| !j ' breast carcinomas with o-erb'B-2 amplification have 2 to 15 times more oerbB'2 

;| [i ; DNA. Tbraors with greater amplification tend to have greater overproduc- 

jj:?'* • tion."'®'^ The non-mammary neoplasms that have been studied tend to have 

I ;! . 

7 I" 

"^ii ■ , ' 
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similar levels of c-er&B*2 amplification or overproduction relative to the eorre' 
sponding normal tissue. 

The second most common mechanism of c-er&B-2 activation is overproduce 
tion of c^r&B*2 mRNA and protein without amplification of c-0r27B-2 DNA.^^ 
The quantities of mKNA and protein usually are less than those in amplified 
cases and may approach the small quantities present in normal hreast or other 
tissues.iT'K^** The c-erfeB-2 protein overproduction without mRNA overproduc- 
tion or DNA amplification has been described in a few btmian breast carcinoma 
cell lines.^ 

Other rare mechanisms of c-erfcB-2 activation have been reported. Translo- 
cations involving the o-er&B-2 gene have been described in a few mammary and 
gastric carcinomas, although some rex)orted cases may represent restriction 
fi-agment length polymorphisms or incomplete lestricttoii en:^me digestions 
that mimic translocations. a smgle point mutation in the transmem- 
brane portion of neu has been described in rat neuroblastomas induced by 
ethylnitrosureai The mutated neu protein has increased tyrosine kinase activ- 
ity and aggregates at the cell membrane. Alfhdu^ there has been specula- 
tion that some of the amplified c-er&B-2 genes may contain point mutations,^ 
none has been detected in primary human neoplasms.^**^"^ 



TECHNIQUES FOR DETECTING ACTIVATION 
Detection of o-erbB-Z DNA Amplification 

Amplification of c-«rfeB-2 DNA ]s usually detected by DNA dot blot or South- 
em blot hybridization. In the dot blot method, the extracted DNA is placed 
directly on a nylon membrane and hybridized with a c^erbB-Z DNA probe. In 
the Southern blot method, the extracted DNA is treated with a restriction 
enzyme, and the fi'agments are separated by electrophoresis, transferred to a 
nylon membrane, and hybridized with a c-erbB-^ DNA probe. In both tech- 
niques, c<-er&B-2 amplification is quantified by comparing the intensity (Mea- 
sured by densitometry) of the hybridization bands from the sample yrith those 
fi*om control tissue. 

Several technical problems may complicate the measurement of 
DNA amplification. First, the extracted tumor DNA may be excessively de- 
graded or diluted by DNA from stromal cells*" Second, the c-cr6B-2 DNA 
probe must be carefully chosen and labeled. For example, oligonucleotide c- 
erfeB-2 probes may not be sensitive enough for measuring a low level of c-erfeB* 
2 amplification, because diploid copy numbers can be diiBcult to detect (unpub- 
lished data). Third, the total amounts of DNA in the sample and control tissue 
must be compensated for/ often with a probe to an unamplified gene. Many 
studies have used control probes to genes on chromosome 17, the location of t> 
erbB-Z^ to correct for possible alterations in chromosome number. Identical 
results, however, are obtained by using control probes to genes on other chro^ 
mosomes,****'^ with rare exception." Studies using control probes to the beta- 
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globin gene must be intexpreted with cautioBi because one aHele of this gene is 
deleted occasionally in breast carcinomas.^ 

Amplification of C"er&B-2 DNA was assessed by using the polymerase 
chain reaction (PCB) in one recent study,^ Oligoprimers for the iyerbB-2 gene 
and a control gene are added to the sample's DNA, and PGR is performed. If 
the sample contains more copies of c-erbB-Z DNA than of the control gene, the 
c-criB-2 DNA is replicated preferentially. 

Detection of c-erbB^2 mRNA Overproduction 

Overproduction of c-erbB-Z mRNA usually is measured by RNA dot blot or 
Northern blot hybridization. 9oth techniques require extraction of BNA but 
bthervise are analogow to DNA dot blot and Southern blot hybridization. Use 
of PGR for detection of mRNA haCbeen described in two recent 

abstracts.^i<« 

Overproduction of c-erfcB-2 mRNA can be measured by in situ hybridiza- 
tion! Sections are mounted on glass slides, treated with protease, hybridized 
with a radiolabeled probe, washed, treated with nuclease to remove unbound 
probe, and developed for autoradiography Silver grains are seen only over 
tumor cells that overproduce c-6r6B-2 mRNA. Negative control probes are 
uscd.^w* Our experience indicates that these tedmiques are relatively insensi- 
tive for detecting c-erbB-2 mRNA overproduction In routinely processed tisr 
sue. Although the sensitivity may be increased by modifications thai allow 
simultaneous detection of c-BrbB-2 DNA and roRNA, In situ hybridization still 
is cumbersome and expensive (unpublished data). 

AD of the above o-er&B-2 mRNA detection techniques have several prob- 
lems that make them more difficult to perform than techniques for detecting 
DNA amplification. One major problem is the rapid degradation of RNA in 
tissue that is not immediately frozen or fixed. In addition, during the detection 
procedure, RNA can be degraded by RNase; a ubiquitous enzyme, which must 
be eliminated meticulously from laboratory solutions. Third, control probes to 
genes that are uniformly expressed in the tissue of Interest need to be carefully 
selected. ^ " ' . 

Detection of c^rl>B-2 Protein Overproduction 

The most accurate methods for detecting c-erhB'2 protein overproduction are 
the Western blot method and Immunoprecipitation, Both techniques can docu- 
ment the binding specificity of various antibodies against c-er6B-2 protein. In 
Western blot studies, protein is extracted from the tissue, separated by electro- 
phoresis (according to size), transferred to a membrane, and detected by using an- 
tibodies to c-6r&B-2. In immunoprecipitation studies, antibodies against c-er&B- 
2 are added to a tumor lysate, and the resulting protein-antibody precipitate is 
separated by gel electrophoresis and stained ibr protein. Both Western blot and 
immunoprecipitation are useiul research tools but currently are not practical for 
diagnostic pathology. Two recent abstracts have described an enzyme-linked 
immunosorbent assay (ELISA) for detection of c-0rtB-2 protein. 
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ACTIVATION OF IN BREAST LESIONS 

Incidence of o-erbB-2 Activation 

Most studies of C'erbB-2 oncogene activation do not specify hlstologica] sub- 
types of infiltrating breast carcinoma. Amplification of c-^rfeB-2 DNA was found 
in 19.1 percent (519 of 271^ of invasive carcinomas in 25 studies ClUble 1), and 
iyerbB'2 mRNA or protein overproduction was detected in 20.9 percent (566 of 
2714) of invasive carcinomas in 20 studies. Twelve studies have documented c- 
^rtB^2 mRNA or protein overproductipn ih 15 percent (88 of 604) of carcinomas 
that lacked c-er2iB-2 DNA amplification^ 

The incidence of c-erl^B^2 activation in infiltrating breast carcinoma varies 
with the histological subtype. Approximately 22 percent (142 of 650) of infiltrat- 
ing ductal carcinomas have c-0r6B-2 activation, as expected from the above 
data. Other variants of breast carcinoma widi frequent c-erbB-2 activation are 
inflammatory carcinoma (62 percent, 54 of 87), Paget's disease (82 percent, 9 of 
11). and medullary carcinoma (22 percent, 5 of 23). In contrast, c-er6B-2 activa- 
tion is infrequent in infiltrating lobular carcinoma (7 percent^ 5 of 73) and 
tubular carcinoma (7 percent, 1 of 15). 

The c*-&r&B*2 protein overproduction is present in 44 percent (44 of 100) of 
ductal carcinomas in situ and especially comedocardnoma in situ (66 percent, 
49 of 72). The raicropapillary type of ductal carcinoma in situ also tends to have 
c-erbB*2 activation,^ "*** especially if larger cells are present, The greater fi-e- 



-1? 



<Hi/frBi2 ONCOGENE 169 

Overproduction of o-er6B-2 protein is most commonly gisscssed by various 
immunobistochemical techniques. These procedures, often generate conflicting 
results, which are explained at least partially by three factors. First, various 
studies have used- different polyclonal and monocbnai antibodies. Because 
some polyclonal antibodies recognize weak bands in addition to the c-erfcB-2 
protein band on Western blot or immunoprecipitation^ the results of these 
studies should be interpreted with caution.'^'^^'^^ Even some monoclonal anti- 
bodies immunoprecipitate protein bands in addition to c-erfcB-2 (pl85).^'«*^« | 
Second, tissue fixation contributes to variability between studies. Ebr example, i 
some antibodies detect c-erbB-Z protein only in fi^ozen tissue and do not react 
in fixed tissue. In general, formalin fixation diminishes the sensitivity of 
iromunohistochemlcal methods and decreases the number of reactive cells*"^'^ 
When Bouin's fixative is used, there may ♦be a hij^r^percejitage of positive 
cases,^ Tlnrd, minimal criteria for interpreting immunohistochemical staining . 
are generally lacking. Althou^ there is general agreement that distinct crisp 
cytoplasmic membrane staining is diagnostic for c-eriB-2 activation in breast 
. carcinoma, the number of positive cells and the staining intensity required to 
diagnose c-eriB-2 protein overproduction varies from study to study and from 
antibody to antibody. Degradation of C'^tB-2 protein is not a problem because 
it can be detected in intact form more than 24 hours after tumor resection . 
without fixation or fireezing.^ 
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quency of c-erfcB-2 protein overproduction in oomedocarcjnoma in situ, com- 
pared with infiltrating ductal carcinoma, could be explained by the fdct that 
many infiltrating ductal carcinomas arise jfrom other types of intraductal carci- 
noma, which show {yeriB-2 activation infrequently. Others have speculated 
that carcinoma io sitii with activation tends to regress or to lose o 

er6B-2 activation during progression to invasion.-^**'® Infiltrating and in situ 
components of ductal carcinoma, however, usually are similar with respect to o- 
^rbB-2 activation,"«39 although some authors have noted more heterogeneity of 
the immiinohistochemical staining pattern in invasive thah in in situ carci- 
noma Activation of.c-erfcB-2 is infrequent in lobular carcinoma in situ. If 
lesions contain more than one histological pattern of carcinoma in situ, the c- 
erfcB-2 protein overproduction tends to occur in the comedocarcinoma in situ 
but may include other areas of xjarcinoma in situ.*2.M,« Overproduction of c- 
•er&B'2 protein in ductal carcinoma in situ correlates with larger cell size and a 
periductal lymphoid infiltrate.*^ 

Activation of c-erfcB-2 has not been identified in benign breast lesions, 
including fibrocystic disease, fibroadenomas, and radial scars (Table 2). Strong 
membrane immunohistochemical reactivity for o-erbM has not been described 
in atypical ductal hyperplasia, although weak accentuation of membrane staining 
has been noted infrequently. In normal breast tissue, o-erfeB-2 DNA is 
diploid, and c-cr6B-2 is expressed at lower levels than in activated tumors,^® ™ 

These preliminary data suggest that c-erbB-2 activation may not be useful 
for resolving many of the common problems in diagnostic surgical pathology* For 
example, cser&B-2 activation is infi-equent in tubular carcinoma and radial scars. 
In addition, because C'erbB'2 activation is unusual in atypical ductal hyperplasia, 
cribriform carcinoma in situ, and papillary carcinoma in situ, detection of c-erbB- 
2 activation in these lesions may not be helpfid in their differential diagnosis. The 
histological features of comedocarcinoma in situ, which commonly overproduces 
c-crfcB-2, are unlikely to he mistaken for those of benign lesions, ActWation of 

TABLE 2, p^rbB^ ACTVATON IN PEWtCiW HUMAN BREAST LESIONS 



Histological Diagnosis 


e^rl>B*2 DHA 


c^rbB-amRNA 


o^rl»B-2 Protein 


Amplification' 


Overproduction 


Overproduction 


Fibrocystic disease 


0/10^ 






Atypical ductal hyperplasia 






2(weali)y21 « 
1(cytopla8nito}/13» 


Benign ductal hyperplasia 






0/12W 


Sclerosing adenosis 






0M» 


Fibroadenomas 


0/16,«0/6w 


0/6 »0/^ 


0/21 M 0/10 M 
0/6.»0/3« 


Radial scars 






0/22» 


Blunt dud adenosis 








"Breast mastosis" 




0/3* 





•Shown as number of cases wim acUvaUon/numaar at cases siuAod; rstorencs Is given as a superecrIpL 
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c-erfcB-2, however, does favor infiltrating ductal carcinoma over infiltrating 
lobular carcinoma. Further studies of diese issues would be useful. 

Correlation of c-erbB-2 Activation With Pathologic Prognostic Factors 

Multiple studies have attempted to correlate c-er&B-2 activation with various 
patholo^c prognostic factors (Table 3), Activation of c-erbBrS was correlated 
with lymph node metastasis in 8 of 28 series, with higher histological grade in 6 
of 17 series, and with higher stage in 4 of 14 series. Large tumor size was riot 
associated with o^rbB-Z activation in most studies (II of 14). Tetraploid DNA 
content and low proliferation, measured by Ki-67, have been suggested as 
prognostic factors and may correlate with c^&B-2 activation.^'' 

Correlation of c-erdB-2 Activation With Clinical-Prognostic Factors 

Various studies have attempted also to correlate G'erbB-2 activation with dinical 
features that may predict a poor outcome (Table 4). Activation of c^r&B-2 
correlated with absence of estrogen receptors in 10 of 28 series and with ab- 
sence of progesterone receptors, in 6 of 18 series. In most studies, patient age 
did not correlate witit c-erbB-^Z activation, and, in the rest of the reports, c- 
erbB-2 activation was associated with either younger or older ages. 

Correlation of c-erbB-2 Activation With Patient Outcome 

Slamon et al^'^i first showed that amplification of the &«rbB*2 oncogene inde- 
pendently predicts decreased survival of patients with breast carcinoma. The 
correlation of c-erbB-2 amplification with poor outcome was nearly as strong as 
the correlation of number of involved lymph nodes witii poor outcome. Slamon 
et al also reported that oerbB-Z ampUfication is an important prognostic indica* 
tor only in patients with lymph node metastasis.^^^ 

A large number of subsequent studies also attempted to correlate c-er^B-2 
activation with prognosis (Table 5). In 12 series, there was a correlation be- 
tween c-erbB'2 activation and tumor recurrence or decreased survival. In five 
of these series, the predictive value of o-erbB'2 activation was reported to be 
independ^t of other prognostic factor$«-In contrast^ 18 series did notconfinn 
the correlation of c-er6B'2 activation with recurrence or survival. Four possible 
explanations for this controversy are discussed below. 

One problem Is that c-erfeB-2 amplification .correlates with prognosis 
mainly in patients with lymph node metastasis. As summarized in Table 5, most 
studies of patients with axiUary lymph liode metastasis showed a correlation of 
O'BrhB-2 activation with poor outcome. In contrast, most studies of patients 
widiout axillary metastasis have not demonstrated a correlation with patient 
outcome. Table 6 summarizes the studies in which all patients (with and with- 
out axillary metastasis) were considered as one group. There is a trend for 
studies with a higher percentage of metastatic cases to show an association 
between c-erfcB-2 activation and poor outcome. Thus, most of the current 
evidence suggests that c-er&B-2 activation has prognostic value only in patients 
with metastasis to lymph nodes. 
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TABLE 6, COflREUTION OP c-erbB-2 ACTWATrON WrfH OVTCOMB IN PATIENTS 
WITH BREAST CARCINOMA 



Number of Patianu 



Typo of 



AtotBvlasttifo 





AotlvatJon* 


rotaf 


AxniBtf . Ho 
Lymph Noden Wetaelssis 


Stattstleal 
Analysfs'' 


Rieference 


<0.05 


DNA 


176 




M 




<6.05 


DMA 


61 




1 1 


An 

W , 


<0.06 


DNA 


57 




1 1 


cc 

09 


<0.06 


DMA 


41 




1 1 




<0,05 


mRNA 


€2 




II 

u 


TO 


<0.05 










lUl- 


<0.05 


DNA 






M 


01 


<0.0S 


DNA 




ion 


1 1 


17 


<0,05 


DNA 




01 

91 


1 1 


a*v 
87 


<0.05 


DNA 








78. 


<0.05 


Protoln-WB 




350 


M 


85 


<0.05 


Protein 




62 44 


U 


101 


0.05-0.15 


DNA 


67 




U 


111 


0.06-0.15 


Protein 


189 




M 


92 


0.05^0.15 


Protein 




120 


U 


86 


>0.15 


DNA 


130 




U 


113 


>0,15 


DNA 


122 




M 


4 


>0.1S 


DNA 


50 




U 


44 


>0.15 


mRNA 


57 




u 


50 


>0.15 


Protein 


290 




M 


66 


>0.15 


Protein 


195 




U 


11 


>0,15 


Protein 


102 




V 


38 


>0.15 


Protein 




137 


u 


17 


>ai5 


DNA 




1B1 


M 


61 


>0.15 


DNA 




' - 159 


u 


fr 


>0.15 


DNA 




73 


u 


87 


>0.16 


Proteln-WB 




378 


u 


85 


>0.15 


Proteln-WB 




192 


u 


17 


>0,15 


Protein 




141 


u 


66 


>0.15 


Protein 




41 


u 


40 



- 1 no encpomo OT m&Bo stLJdies were lumor nwurrence or decrBSMd «uivlv«t or both. Correlallon between o» 
erbB-z actfvabon and a poorer paiient ouicoma }s datlsUeelV significant ai <0.05. Is ot equivocal eignMcahoe 
at 0.05 to 0.16, and l& nol sienlKcant al >0'.16. 

^Shown B9 variable measure. Lettera 'V/B" fndleets assay by Western blot; the other protein atudies used 
ImmunohtetochBrnicai methods. 

m muHlvariate statistical analysis; U « untvarfala stafistloal analysis. 
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liS^ifu'*^^*^^'^^®^ BREAST CARCINOMAS WITH METASTASIS COMPARED 
WITH PROGNOSTIC 51GNIFICAKCE OF »er&P^2 ACTIVATION 



%oTtufm)rewim 
lymph node 
metastasis In 
eadi study 



70- 



80- 



GO- 



AO ^ 



71 (DNA)» 



61 (DNA)«> 
69 (DNA)<7 



e4(DNA)<" 



4Z(Protein}B 



P<0.05 



0.05<P<0.16 



64(mRNA)» 
61(ONA)4 



68(ONA)« 

57{DNA)"» 

S5(Proteln)» 

48(ProleIn)<' 
46(ProteIn)«" 



P>o:i5 



P for correlation of o-erbB<z aotfvatfan with pattern outoome. 

Blud/8 percentage of breast carcinomas m\ metastasta la oomparetf wHh the cofretefion between o- 
ert)B-2 acUvatlon and outcome. These data Include oAly thosestudlee Ihatconsldered, aa one snwp, aS breaat 
carwer patients, wbeiher or not they had axHtary metastasis. SuperaerlptB m the rafarenoas. In parenttieses 
are the ^es ol c-er6B-2 activaUon. lvalues are mteiprated as In TaUe 3. 



A second problem is that various types of breast caranoma are grouped 
togetiier in many surviyal studies. Because the ciiirent literature suggests that 
c-€r6B-2 activation is infrequent In lobular carcinoma, studies that combine 
infiltrating ductal and lobular carcinomas may dilute the prognostic eflPect of o- 
eriB-2 acUvatlon in du^jtal tumors. In additian^ most studies do not analyzfe 
inflammatory breast carcinoma separately. This condition frequently shows c- 
erbB-2 activation and has a worse prognosis than the usual mammary carci- 
noma, but it is an uncommon lesion. 

A third potential problem is the paucity of studies that attempt to correlate 
c-erfeB-2 activation with clinical outcome in subsets of breast carcinoma without 
metastasis. TWo recent abstracts reported that in patients without lymph node 
metastasis who had various risk factors for recurrence (such as large tumor size 
and absence of estrogen receptors), o-er&B-2 overexpression predicted early 
recurrence, In patients with ductal carcinoma in situ, one smaU study found 
no association between tumor recurrence and c-erfeB-E activation.*^ 

A fourth problem is the laclc of data regarding whether the prognosis 
correlates better with c-erbB-2 DNA araplificaHon or with mRNA or protein 
overproduction. Most studies that find a correlation between c-er6B-2 actlva- 
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tion and poor patient outcome measure c-«riB-2 DNA .amplification (Table S), 
and breast carcinoma patients with greater amplification of c^r&B-2 may have 
poorer survival Recent studies suggest that ampllflcaUon has more prognos- 
tic power than overproduction,i7«3<-» but the clinical significance of c-0rfcB-2 
overproduction without DNA amplificatipn deserves further research. Few 
studies have attempted to correkte patient oute>me with o-e>fcB-2 mRNA 
overproducUon. and many studies of c-erfeB-2 protein overproduction use rela- 
tively less reliable methods such as immunohistochcmical studies with poly- 
clonal antibodies. 

Comparison of G-ertiB-2 Activation With Other Oncogenes In 
Breast Carcinoma 

Other oncogenes that may have prognostic implications in iuman breast cancer 
are reviewed cUcwhere.".i« Hiis section will be restricted to a comparison 
between the clinical relevance of c-er&B-2 and these other oncogenes. 

The c-myc gene is often activated in breast carcinomas, but c-myc activa- 
tion generally has less prognostic importance than c-eriB-2 activation, w,m»t7.87.w 
One study found a correlation between increased mRNAs of c^iiB-2 and c- 
myc, although other reports have not confirmed diis.^*.!* Subsequent research, 
however, could demonstrate a subset of breast carcinomas in which c-myc has 
more prognostic importance than c-cr6B-2, 

The gene c-eriB-1 for the epidermal growth factor receptor (EGFB) is 
homologous with c-erbB-2 but is Infrequently amplified in breast carcinomas « 
Overproduction of EGFR, however, occurs more frequently than amplification 
and may correlate with a poor prognosis. In studies that have examined both c- 
erfeB-2 and EGFR in the same tumor, has a stronger correlation with 

poor prognostic factors. » « Studies have tended to show no correlation between 
amplification of c^rbB-2 and c^er&B-l or overproduction of c-er2yB-2 andEOFR, 
although at the molecular level EGFR mediates phosphorylation of c-eriB-2 
protein.«.w»«.M.i«) Recent reviews describe EGFR in breast carcinoma.«.»« 

The genes c-effeA and ear^l are homologous to the diyroid hormone recep- 
tor, and they are located adjacent to^^bB^ on chromosome 17. These geneii ' 
are frequently coamplified with o^rtB-2 in breast carcinomas. The absence of 
c-erfeA expression in breast carcinomas, however, is evidence against an impor- 
tant role for this gene In breast neoplasia. « Amplification of c-erfeB-2 can occur 
without ear-l amplification, and these tumors have a decreased survival that is 
similar to tumors with both c-erbB-Z and ear-l amplification. Consequently, 
c-eriB-2 amplification seems to be more important than amplification oIoerhA 
or ecr-1. 

Other genes also have been compared with c.^&B-2 activation in breast 
carcinomas. One study found a significant correlation between increased o-erbB- 
2 mRNA and increased mRNAs of/os, platelet-derived growth factor chain A, 
and Ki-roj.Jw Allelic deletion of c-Ha-ra* may indicate a poorer prognosis in 
breast carcinoma.«i but it has not been compared Math c-erfcB-2 activation. Some 
studies have suggested a correlation between advanced stage or recurrence of 
breast carcinoma and activation of any one of several oncogenes. 
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ACrrVATION OF c-erbB-2 IN NON-MAfiflMARY TISSUES 

Incidence of c-erl)B-2 Activation in Non-Mammaiy Tlesues 

Table 7 summarizes the normal tissues in which c-«rbB-2 expression has been 
detected, usually with immuiaohistochemic&l methods using polyclonal antil- 



TABLE 7. PRESENCE OR ABSENCE OF crertB^Z mRNA OR >erbB'l PROTEIN IN 
NORMAL HUMAN TISSUES 



TisBuesWIth 
c-er^B«2 
mRNA 



Tissues Producing Tissues UcKlng Tissues LaeMng 

o-ef5B-2ProMn> . e-MbB-2mRNA . Proton 



Stofnach" 

Jejunum^ 

Oolonw 

Kldney«« 



Uver« 



Lung^ 



Fetal brain» 

Thyroid* 
Uterus^ 



Placenta^ 



EpIdemU&v 
Bdemat root shealh^ 
EoDrtne ewaat gland^ 

Fetal oral mucosa^ 
F^tal esophagus^ 
Slomach22-« 
Felal int©stlns«2» 

Stnall Intestlne^^ 

Fetal kidneys* 

Fetal proximal tubule*^ 
Distal tubute^ 
Fetal collecting duct" 
Fetal renal pelvis'" 
Fetal ureter* 
Hepatocytes^ 
Pancreatic acini=* 
Partcreatic ducls**^ 
Endocrfne cells of telats 

ot Lengerhans^ 
Fetal trachea^. 
Fetal bronc^lo1e8<> 
Bronchioles"^ 



Fetal ganglion cells^ 



Kldnaysio« 



Ovary" 

Bloodvessels^ 



Postnatal oral muoosa*^ 
Postnatal esophagus^ 



Glonjerulus? 

Postnatal Bowman's capsule^ 
Postnatsd proximal tubule*^ 

Postnatal collecting dud^ 
Postnatal rensi pelvis^ 
Postnatal fetal ureter^ 
Uver«w5 



Pancreatic Islets'^ 

Postnatal trachea^ 
Postnatal bronchloleB"^ 

Postnatal alveolP^^ 
Postnatal braln« 
Postnatal gangficn ceUs« 



Endothelium" 

Adrenooorlloal cells^ 
Postnatal thymus^ 
Fibroblasts" 
Smooth muscle cells* 
Cardlao muscle cells^ 



This protein study used Western blots; the rest used Immunohlstochsmlcsl methods, 
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bodies. Only a few studies have been perfonned, and some of these do not 
deraonsbrate convincing ceU membrane reactivity in the published photo- 
graphs. The interpretations in these studies, however, «ro listed, with the 
caveat that these findings should be confirmed by immonopfedpltation or 
Western or RNA blots. Production of c-erbB-2 has been identified in nbiroal 
epithelium of the gastrointestinal tract and sldn. Discrepancies regarding 
.eriB-2 protein in otiier tissues could be due, at least In part, to difierences in 
techniques. 

The data on c-erbB-2 activation in various non-mammary neoplasms 
should be interpreted with caution, because only small numbers of tumors have 
been studied, usually by immunohistochemical methods using polyclonal anti- 
bodies. Studies using cell lines have been excluded, because cell culture can 
induce amplificatioh and overexpression of other genes, although this has. not 
' been documented for c-er2rB-2. " 

AcUvation of c-er6B-2 has been identified in 32 percent (64 of 203) of 
ovarian carcinomas in eight studies (Table 8). One abstract*^ stated that ovarian 
carcinomas contained significantly more c-erfcB-2 protein than ovarian non- 
.epithelial malignancies.. Another report" showed that 12 percent of ovarian 
carcinomas had c-^r&B;'2 overproduction without amplification. 

Activation of c-er&B-2 has been identified in 20 percent (40 of 198) of 
gastric adenocarcinomas in seven studies, including 33 percent (21 of 64) of 



TABLE 8. ACTIVATION IN HUMAN GYNECOLOGIC TUMORS* 









CNenbB'2 






RlRNA 


Protein 






Ovar^ 


Over- 


Tumor Type 


Amplification 


production 


production 


Ovary— carcinoma, not oUvetwlse 


3l/120«1/11,fi' 


23/67* 


23<73,« 


specifiad 


0/S,W0/6,«0/3»^» 






0/2«Q/1«o 






Ovary— serous {papOlajy) carcinoma 


2^,110 i/7^»18 0/672 






lOvary—endomatrfoW 'carcinoma- 


o/anv- "-'-'^ 






Ovary— mucinous carcinoma 








Ovary— clear cell carcinoma 


0/2/«0/1'» 






Ovary— mixed epithelial carcinoma 


0/2» 






Ovary — endometrioid borderOne tumor 


0/1 « 






Ovary— mucinous borderfine tumor 


0/3^e 






Ovary— serous oystadenoma 


0/4W 






Ovary— mucinous cystadenoma 


0«w 






Ovary— sdfifosing stromai tumor 








Ovary— fibTOttiecoma 


on" 






UleruB— endometrial adenocarcinoma 









•Siiown as number o! ca$es with ampJillcaiion [or overpK)ducl!on)rtolal numbor of case$ studied: reference is 
given as supBraoitpL All protein studies used ImmunoWstMhemlcal methods. 
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Intestinal or tubular subtypes and 9 percent {4 of 47) of di£Fuse or signet ring cell 
subtypes (Table 9). AcKvaHon of has been detected in 2 percent (6 of 

281) of colorectal carcinomas, although an additional immunohistocheinical 
study detected c-er&B-2 protein in seven of eight tissues fixed in BouJn's solu- 
tion. One study found greater immunohistochemical reactivity for o-eriB-2 
protein in colonic adenomatous polyps than in the adjacent nomal epithelium, 
using Bouin s fixative. Lesions with anapIasUc features and progression to iiiva- 
si^ carcinoma tended to show decreased immunohistochemical reactivity for c- 
er6B-^2 protein.® Hepatocellular carcinomas (12 of 14 cases) and cholaugiocairei- 
nomas (46 of 63 cases) reacted with antibodies against c-ef*B-2.1n one study* but 
some of these "positive'' cases showed only difiuse cytoplasmic staining, which 



TABLE 9. ACTIVATION IN HUMAN GASTHOINTESTiNAL TUMORS' 



Protein 



Tumor Type 


c-eft»2DNA 


Over- 


ATTtplMcatton 


production 


Esophagus— squanious cell carcinoma 


0/1107 


0/1« 


Stomach— carcinoma, poorly (Ufreremiated 






Stomach— adenooardnoma 












Stomach— cardnoma. Intestinal or tubular type 


5/10'o» 


16/64» 


Stomach— carcinoma, diffuae or signet ring cell type 


0/2ioft 


4/4S» 


Colorectum— carcinoma 


2/49rM 1/45 «» 


1/22»r/822» 




1/45»w.i/45,» ■ 










Colon— vfllous adenoma 


0/tw 




Colon— tulHitovllIous adenoma 






Colon— tubular kdenoma 


0/7» 


iwea» 


Colon—hyperplasllc polyp 


0/1» 




lnte5tlna--lekmtyosarcpma 






Hepatocellular can^ioma — 


-0/12111' 




Hepatoblastoma 


0/15'. 




Cholanglocardnoma 






Pancreas— adenocarcinoma 




2/B0.«»0«»i 


Pancreas-^aolnar carcinonna 






Pancreas — clear cell carcinoma 




0/2<« 


Pancreas— large cell carcinoma 




0/3<i 


Pancreas— signet ring carcinoma 




Q/1« 


Pancreas— chronic Inflammation 







•Shown as number of cases with amplHfcatlon (or overprocjucUon)/lolal number ol cases studied; reTarenoe is 
glvan ae superscript AQ piotaln studies used ImrminoNstochemlcal methods. No stucBee anslyzed for o^/6B< 
2mRNA. — 
Tissues nxed In Bouln'e sohJUon. 

eOnly cases with distinct membrane staining are Interpreted as showing c-ef6B-2 overproduction. 
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TABLE 10, fr€ffeP.2 ACTIVATION iN HUMAN PULMOWARY TUMORS^ 











Tumor Typo 




0^B-2DNA 


PfOt^n 




AmplHlcallon 


Overproduction 


Non-small ob(1 carcinoma 


2/eo« 




1/B4» 


Epldemiold cardnoma 


0/130 


0/10,*' 0^6» 




Adenocarcinoma 


0/21« 


1/13," 0/7..'« 0^7.5' 0/3W 


4/12» 


Large cell carcinoma 


0/9 «0 






Sman cell carcinoma 






0/26," Q/3» 


Carcinoid lumor 


DDK 




.0/3W 



•Shown as mmibw of oases w!th ampllftoation {or overprodudlonVtotal number of cases fitudlaa; reference Is 
ff^w as 6Upei8orf|n. All protein studies used JmmunohtetochemlcaJ meUuxto. No studies analyzed lor c^/^d- 
2 tnRNA» 



does not indicate c-erbB-^ activation to breast neoplasms.* Also, some panwe- 
atic cajcinoinas and chronic pancreatitis tissue had cytoplasmic immuriohisto- 
chemical reactivity for o-crfcB-2 protein, in addition to the rare case of pancre- 
atic adenocarcinoma with distinct cell membrane staining.^ 

Tables 10 through 14 summarize the studies of c-0r&B-2 activation in other 
neoplasms. The oerbB'2 oncogene is not activated in most of these tumors^ 
Activation of oerbB^2 has been detected in 1 percent (4 of 299) of pubnonary 
non-small cell carcinomas in nine studies, although one additional report^ 
found o.eriB-2 protein overproduction in 41 percent (7 of 17). Renal cell carci- 
noma bad o.er6B.2 activation in 7 percent (2 ofSO) in four studies. Overproduo 
tion of c-er&B*2 protein was described in one transitional cell carcinoma of the 
urinary bladder, a grade 2 papillary lesion « Squamous cell carcinoma and basal 
cell carcinoma of the skin may contain c-er^B-2 protein, but It is not dear 



TABLE 11, C-ert)B-2 ACTIVATION IN HUMAN HEMATOLOGIC PROLIFERATIONS* 



*> 

Tumor TVpa 


-*e^ibB-2DNA " 
Amplification 


c^r6B-2 
riiRNA 
' Ov«C 
prbduellon 


o^B-2 
Protein 
Over- 
production 


Hematolopic mallgnanctes 


0/23«'» 






Malignant lymphoma 


0/9^0/3107 


QfV 


0/15« 


Acute leukemia 


0/145^ 






Acute lymphoblastic leukemia 


0/1 w 






Acute myeloblastic leukemia 


0/3W 






Chronic feukerrtia 


0/18" 






Chronic lymphocyilo leukemia 








Chronic myelogenoue leukemia 








Myeloprollfejatlve disorder 


0/1.".. 







■Shown as number of oases wHh ampHncallon (or overproduc(fon)Aoial numlier of cases studfod: refersnoe Is 
given as siiper&cript. All protein studies used Immunohlslochemlca) methods. 
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TABLE 12, <>erfeB-2ACTIVATtON IN HUMAN TUMORS OF SOFT TISSUE AND BONE- 



TUmorTypa 



C^rbB-2 0NA 
Ampllflcation 



Sarcoma 0/10,^^1 OfB^^ 
Matlgnani fibrous histiocytoma OfV^ 

Uposarcoma 0/3w 

Plaomofphfc aarcoma m 

Rhabdomyosarooma Q/V^ 

Osteogenic sarcoma ' (V2,i^0OT 
Chondrosarcoma 

Ewing's sarcoma dfie^ 

Schwannoma o/i^ 



•Shown as number of cases with amplifiomton (oroverpit)du6«on>^al number of cases studied: refemnce Is 
gtven as superscripL No studies analyzed for c^B*2 mRNA or o^frB^ pnitoln. 



whether the protein level is increased over that of normal sldh.sfl Thyroid 
carcinomas and adenomas can have low levels of increased o-erbB-2 mRNA. 
One abstract, described low-level c-erbB-Z DNA amplification In one of ten 
salivary ^and pleomorphic adenomas.* 

Correlatton of c-firbB-2 Activation With Patient OutcomB 

Very few studies have attempted to correlate c<er6B-2 activation in non- 
mammary tumors with outcome.* Slamon et al" showed that c-erbB-2 amplifica- 
tion or overexpression in ovarian carcinomas correlates with decreased survival, 
especially when marked activation is present However, they did not report the 
stage, histological grade, or histological subtype of dicse neoplasms. Another 
study of stages HI and IV ovarian carcinomas found a conrehtion between 
decreased survival and c-erfeB-2 protein overproduction, but not between sur- 
vival and histological grade. One abstract stated that o-erfeB-2 protein ovexpro- 
duction in 10 of 16 pulmonary adenocarcinomas correlated with decreased 
disease-firee inteival.-w Another afestrapXdescribed ifc-tendont^ lor imznunohisto- 



TABLE13. C-ertbP-2 ACTIVATION IN HUMAN TUMORS OF THE URINARY TRACT* 



Tumor Type 


CNort)S*2 DNA 
Ampllflcaiion 


o-erbB-2 
mRNA 
Over* 
production 


c-erbB^2 
Protein . 
Ovar« 
production 


Kidney— renal cell carcinoma 


l/S.^^ 1/4.^0^ 0/5M 


0/16»w 




Wilms' tumor 








Prostate^adenocardnoma 






0/23^ 


Urinary bladder-^carolnoma 






1/48» 



•Shown as number of cases vdlh ampliricallon (or overprodudlonytotal number of cases studied; reference Is 
glvon as aupeiscript. All protein studies uaeO Immunohlatochooiical methods. 
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TABLE 14> MtftoM ACTIVATION IN MISCELUWEOUS HUMAN TUMORS' 



Tumor Type 



Skin— malignant melanoma 
Skin, head and neck — squamous 
ceil carcinoma 

SUb not stated— squamous call 
carcinoma 

Sallvanf giaml— adenocarolnoma 
Parotid gland— adenoid oysUe 
carcinoma 

' Thyroid— anaplastic cardnoma 
Thyroid— penary cardnoma 
Thyroid— ctdenocardnoma 
Thyroid— adenoma 
Neuroblastoma 
Meningioma 



DNA 
Ampliffcatlon 



Oveiptoduotfon 



Protein 
Over- 
produetfon 



0/7*07 



1/1 » 



0/1« 
0/1^ 

0/35,«0/8,«'0/iw * 
0/zw 



0/108B 



3(lowlevete)/5i 
1(lowtevel8)/2^ 



fllvan as supBrsotpt All pmtein studies used trnmuni^lstaehemfoal methods. v 



chemical reacKvity far c^erbB-2 protein to correlate with higher grades of pros- 
tefacadenocaicinoma.«AdAtior)al prognostic studies of ovarian carcinomiw artd 
other neoplasms are needed. 



SUMMARY 

Activation of the c^6B-2 oncogene can occur by amplification of 
DNA and by overproduction ofo-er6B-2mRNAand WfcB-2pix>tein. Appiori- 
matcly 20. percent of breast carcinomas shojvr. !?yidence of ^^&Bi2^vation 
which correlates with a poor prognosis primariiy in patients with metastasis to 
axillary lymph nodes. Studies that have attempted to correlate c.erfeB-2 activa- 
tion with other prognostic factors in breast carcinoma have reported conflicting 
conclusions. The pathologic and clinical significance of c-eriBw2 activation in 
other neoplasms is unclear and should be assessed by additional studies 
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I, J. Christopher Grimaldi, declare and say as follows: 

1 . I am a Senior Research Associate in the Molecular Biology Department of 
Genentech, Inc., South San Francisco, CA 94080. 

2. I joined Genentech in January of 1999. From 1999 to 2003, 1 directed the Cloning 
Laboratory in the Molecular Biology Department. During this time 1 directed or performed 
numerous molecular biology techniques including flualitative Polymerase Chain Reaction (PCR) 
analyses. I am currently involved in, among other projects, the isolation of genes coding for 
membrane associated proteins which can be used as targets for antibody ttierapeutics against 
cancer. In connection with the above-identified patent application, I personally performed or 
directed the semi-quantitative PCR analyses in the assay entitled "Tumor Versus Normal 
Differential Tissue Expression Distribution" which is described in EXAMPLE 18 in the 
specification that were used to identify differences in gene expression between tumor tissue and 
their normal counterparts. 

3. My scientific Curriculum Vitae, including my list of publications, is attached to 
and forms part of this Declaration (Exhibit A), 

4. In differential gene expression studies, one looks for genes whose expression levels 
differ significantly under different conditions, for example, in normal versus diseased tissue. 
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Chromosomal aberrations, such as gene amplification, and chromosomal translocations are 
important markers of specific types of cancer and lead to the aberrant expression of specific 
genes and their encoded polypeptides, including over-expression and under-expression. For 
example, gene amplification is a process in which specific regions of a chromosome are 
duplicated, thus creating multiple copies of certain genes that normally exist as a single copy. 
Gene under-expression can occur when a gene is not transcribed into mRNA. In addition, 
chromosomal translocations occur when two different chromosomes break and are rejoined to 
each other chromosome resulting in a chimeric chromosome which displays a different expression 
pattern relative to the parent chromosomes. Amplification of certain genes such as Her2/Neu 
[Singleton a/., Pathol. Annu. . 27Ptl: 165-190], or chromosomal translocations such as t(5;14), 
[Grimaldi et aL Blood 73(8):2081-2085(1989); Meeker et al. Blood . 76(2) :285-289(l 990)] give 
cancer cells a growth or survival advantage relative to normal cells, and might also provide a 
mechanism of tumor cell resistance to chemotherapy or radiotherapy. When the chromosomal 
aberration results in the aberrant expression of a mRNA and the corresponding gene product (the 
polypeptide), as it does in the aforementioned cases, the gene product is a promising target for 
cancer therapy, for example, by the therapeutic antibody approach. 

5. Comparison of gene expression levels in normal versus diseased tissue has 
important implications both diagnostically and therapeutically. For example, those who work in 
this field are well aware that in the vast majority of cases, when a gene is over-expressed, as 
evidenced by an increased production of mRNA, Ihe gene product or polypeptide will also be 
over-expressed. It is unlikely that one identifies increased mRNA expression without associated 
increased protein expression. This saine principle applies to gene under-expression. When a 
gene is under-expressed, the gene product is also likely to be under-expressed. Stated in another 
way, two cell samples which have differing mRNA concentrations for a specific gene are 
expected to have correspondingly diflferent concentration of protein for that gene. Techniques 
used to detect mRNA, such as Northern Blotting, Differential Display, in situ hybridization, 
quantitative PGR, Taqman, and more recently Microarray technology all rely on the dogma that a 
change in mRNA will represent a sinndlar change in protein. If this dogma did not hold true then 
these techniques would have little value and not be so widely used. The use of mRNA 
quantitation techniques have identified a seemingly endless number of genes which are 
dififerentially expressed in various tissues and these genes have subsequently been shown to have 
correspondingly similar changes in their protein levels. Thus, the detection of increased mRNA 
expression is expected to result in increased polypeptide expression, and the detection of 
decreased mRNA expression is expected to result in decreased polypeptide expression. The 
detection of increased or decreased polypeptide expression can be used for cancer diagnosis and 
treatment. 

6. However, even in the rare case where the protein expression does not correlate 
with the mRNA expression, this still provides significant information usefiil for cancer diagnosis 
and treatment. For example, if over- or under-expression of a gene product does not correlate 
with over- or under-expression of mRNA in certain tumor types but does so in others, then 
identification of both gene expression and protein expression enables more accurate tumor 
classification and hence better determination of suitable therapy. In addition, absence of over- or 
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\inder-expression of the gene product in the presence of a particular over- or under-expression of 
niRNA is crucial information for the practicing clinician. For example, if a gene is over-expressed 
but the corresponding gene product is not significantly over-expressed, the clinician accordingly 
will decide jiot to treat a patient with agents that target that gene product. 

7. I hereby declare that all statements made herein of my own knowledge are true and 
that all statements made on information or belief are believed to be true, and further that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code and that such willful statements may jeopardize the validity of the application or any 
patent issued thereon. 

By: I I /^"^^ Date: ^1 /^) 

1. Chnstopher Grimaldi 

S:\DOCS\AOIC\aJ!)K-5479.DOC 
071904 f 



DEC 0 8 7005. ^, 



Exhibits 



'^Wi^^ DECLARAT[ONOFPAULPpLAiaS,Ph.D, 
I, Paul Polakis, Ph,D., declare an4 say as follows: , 

1. I was awarded a Ph.p. by the Department of Biochemistry of the Michigan 
Sta^e Umversily m 1984. lyfy scientific Curriculum Vitae is ^ched to and fomis 
part.ofthis Declaration CExhibit A). . 

2. I ain currently employed by Genentech, Inc. where my job title is Staff 
Scientist^ Smce joining Genentech in 1999, one of my primary responsibilities has 
been leadmg Genentechs Tumor Antigen Project, which is a large research project 
witii a primary focus on identifying tumor cell markers that find use as targ^ for 
both the diagnosis and treatment of cancer in humans. 

Tumor Antigen Project, my laboratory has been analyzing 
diflerential expression of various genes in tumor cells relative to normal cells 
Ihe purpose of this research is to identify proteins, that are abundantly expressed 
on certam tumor cells and that are either (i) not expressed, or (ii) expressed at 
lower levels, on corresponding normal cells. We caU such differraitially expressed 
proteins "tumor antigen proteins", When such a tumor antigen protein is 
Identified, one can produce an antibody that recognizes and binds to that protein 
buch an antibody finds use in the. diagnosis of human cancer and may ultimately 
serve as an effective therapeutic in the treatment of human cancer. . 

4. In .the course of the research conducted by Genentech's Tumor Antigen 
Project, we have .employed a variety of scientific techniques for detecting Md 
studymg differential gene expression in human tumor cells relative to normal cells 
at genomic DNA, mRNA and protein levels. An important example of one such ' 
technique is the well known and widely used technique of microarray analysis 
whjch has^rovento be extremely usefiil for the identification of mRNA moleeiSles 
that are differentially expressed in one tissue or cell type relative to another In the 
course of our research using microarray analysis. We have identified 
approximately 200 gene transcripts that are present in human tumor cells at 
significantly higher levels than in corresponding nornial human cells. To date we 
have generated antibodies that bind to about 30 of the tumor antigen/proteins ' ' 
expressed fi-om these differeritialfy expressed gene transcripts and have Used these 

antibodies to quantitativdy determine the level of production of these tumor 
antigen protems m both human cancer cells and corresponding normal cells We 
have then wmpared the levels of mRNA and protein in both the tumor and normal 
cells analyzed. 

5 From.the mRNA and protein expression analyses described in paragraph 4 
above, we have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 
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expressed from that mRNA in that cell type. In approximately 80% of our . 
observations we have found that increases in the level of a particular m]RNA 
correlates with changes in the level of protein expressed from that mRNA when 
human tumor cells are compared with their corresponding normal cells. 

.6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4 and 5 above aiid my . 
knowledge of the relevant scientific literature, it is my considered scientific 
. ojpinibn that for human genes, an increased level of riiKNA in a tumor cell relative 
. to a normal cell typically correlates to a similar increase in abimdance of the 
encoded protein in the tumor cell relative to the normal pell. In fact, it remains a 
central dogma in molecular biology that increased mRNA levels 2tre predictive of 
conresponding increased levels of the. encoded protein. . While there have been 
published reports of genes for which such a correlation does not exist, it is my 
opinion that such reports are exceptions to the commonly understood general rule 
that increased mRNA levels are predictive of corresponding increased levels of the 
encoded protein. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belieif are believed to be true, 
arid fiather that these statements were made with the knowledge that willfiil felse 
statements and the like so made are puipshable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that such willfiil 
statements may jeopardize the validity of the application or any patent issued 
fliereon. 
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^^cts. If these minor cell proteins dififer among cells to the same extent as the 
more abundant proteins, as is commonly assumed, only a small number of pro- 
•BiD differences (perhaps several hundred) suffice to create very large differences 
Iji ceU morphology and behavior. 

ACell Can Change the Expression of Its Genes 
in Response to External Signals ^ 

j^ost of the specialized cells in a muhicellular organism are capable of altering 
their patterns of gene expression in response to extracellular cues. If a liver cell 
is exposed to a glucocorticoid hormone, for example, the production of several 
specific proteins is dramatically increased. Glucocorticoids are released during 
periods of starvation or intense exercise and signal the liver to increase the 
production of glucose from amino acids and other small molecules; the set of 
pioteins whose production is induced includes enzymes such as tyrosine amino- 
transferase, which helps to convert tyrosine to glucose. When the hormone is no 
longer present, the production of these proteins drops to its normal level. 

Other cell types respond to glucocorticoids in different ways. In fat cells, for 
example, the production of tyrosine aminotransferase is reduced, while some 
other cell types do not respond to glucocorticoids at ail. These examples illi2Strate 
a general feature of cell speciaiizatioiv^-different cell types often respond in dif- 
ferent ways to the same extracellular signal. Underlying this specialization are 
features that do not change, which give each ceU type its permanentiy distmc- 
ilve character. These features reflect the persistent expression of different sets of 
genes. 

Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein ^ 

If differences between the various ceU types of an organism depend on the par- 
ticular genes that the cells express, at what level is the control of gene expression 
exercised? There are many steps in the pathway leading from DNA to protein, and 
all of them can in principle be regulated. Thus a cell can control the proteins it 
makes by (1) controlling whwi and how often a given gene is transcribed (tran- 
scriptional control), (2) controlling how the primary RNA transcript is spliced or 
othenvise processed (RNA processing control), (3) selecting which completed 
mRNAs in the cell nucleus are exported to the cytoplasm (RNA transport con- 
troDi (4) selecting wiiich mRNAs in the cytoplasm are translated by ribosomes 
(translational control), (5) selectively destabilizing certain mRNA molecules in 
the cytoplasm (mRNA degradation control), or (6) selectively activating, inacti- 
vating, or compartmentalizing specific protein molecules after they have been 
made (protein activity control) (Figure 9^2). 

For most genes transcriptional controls are paramount. This makes sense 
because, of all the possible control points illustrated in Figure 9-2, only transcrip- 
tional control ensures that no superfluous intermediates are synthesized In the 
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Figure 9-2 Six steps at which 
eucaryote gene expression can be 
controlled. Only controls that operate 
at steps 1 through 5 are discussed in 
this chapter. The regulation of protein 
activity (step 6) is discussed in 
Chapter 5; this includes reversible 
activation or inactivation by protein 
phosphorylation as well as 
irreversible inactivation by proteolytic 
degradation. 
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following sections we discuss the DNA and protein components that regulate the 
initiation of gene transcription. We return at the end of the chapter to the other 
ways of regulating gene expression. 

Summary 

The genome of a cell contains in its DNA sequence the information to make many 
thousands of different protein and RNA molecules, A cell typicaUy expresses only a 
fraction ofits genes, and the different types of cells in multicellular organisms arise 
because different sets of genes are expressed Moreover, cells can change the pattern 
of genes they express in response to changes in their environment, such as signals from 
other cells. Although all of the steps involved in expressing a gene can in principle be 
regulated, for most genes the initiation of RNA transcription is the most important 
poirit of control 



DNA-binding Motifs in Gene 
Regulatory Proteins ^ 

How does a cell determine which of its thousands of genes to transcribe? As dis- 
cussed in Chapter 8, the transcription of each gene is controlled by a regulatory 
region of DNA near the site where transcription begins. Some regulatory regions 
are simple and act as switches that are thrown by a single signal. Other regula- 
tory regions are complex and act as tiny microprocessors, responding to a vari- 
ety of signals that they interpret and integrate to switch the neighboring gene on 
or off. Whether complex or simple, these switching devices consist of two fun- 
damental types of components: (1) short stretches of DNA of defined sequence 
and (2) gene regulatory proteins that recognize and bind to them. 

We begin our discussion of gene regulatory proteins by describing how these 
proteins were discovered. 



Gene Regulatory Proteins Were Discovered Using 
Bacterial Genetics ^ 

Genetic analyses in bacteria carried out in the 1950s provided the first evidence 
of the existence of gene regulatory proteins that turn specific sets of genes on 
or off. One of these regulators, the lambda repressor, is encoded by a bacterial 
virus, bacteriophage lambda. The repressor shuts off the viral genes that code for 
the protein components of new virus particles and thereby enables the viral ge- 
nome to remain a silent passenger in the bacterial chromosome, multiplying with 
the bacterium when conditions are favorable for bacterial growth (see Figure 
6-80). The lambda repressor was among the first gene regulatory proteins to be 
characterized, and it remains one of the best understood, as we discuss later. 
Other bacterial regulators respond to nutritional conditions by shutting off genes 
encoding specific sets of metabolic enzymes when they are not needed. The lac 
repressor, for example, the first of these bacterial proteins to be recognized, turns 
off the production of the proteins responsible for lactose metabolism when this 
sugar is absent from the medium. 

The first step toward understanding gene regulation was the isolation of 
mutant strains of bacteria and bacteriophage lambda that were unable to shut 
off specific sets of genes. It was proposed at the time, and later proved, that most 
of these mutants were deficient in proteins acting as specific repressors for these 
sets of genes. Because these proteins, like most gene regulatory proteins, are 
present in small quantities, it was difficult and time-consuming to isolate them. 
They were eventually purified by fi*actionating cell extracts on a series of stan- 
dard chromatography columns (see pp. 166-169). Once isolated, the pro- 
teins were shown to bind to specific DNA sequences close to the genes that they 



Figure 9-3 Double-heUcai structure 
of DNA. The major and minor grooves 
on the outside of the double helix. are 
indicated. The atoms are colored as 
follows: carbon, dark blue; nitrogeJ*' 
light blue; hydrogen, white; oxj^eni 
red; phosphorus, yellow. 
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Summary 

The many types of cells in animals and plants are created largely through mecha- 
nisms that cause different genes to be transcribed in different cells. Since many spe- 
ciaHked animal cells can maintain their unique character when grown in culture, the 
regulatory mechanisms involved in creating them must be stable once estab- 
lished and heritable when the cell divides, endowing the cell with a memory of its 
developmental history, Procaryotes and yeasts provide unusually accessible model 
systems in which to study gene regulatory mechanisms, some of which may be rel- 
mnt to the creation ofspecidUzed cell types in higher eucaryotes. One such mecha- 
nism involves a competitive interaction between two (or more) gene regulatory pro- 
teins, each of which inhibits the synthesis of the other, this can create a flip-flop 
switch that switches a cell between two alternative patterns of gene expression. Di- 
rect or indirect positive feedback loops, which enable gene regulatory proteins to 
perpetuate their own synthesis, provide a general mechanism for cell memory. 

In eucaryotes gene transcription is generally controlled by combiruitions of gene 
rectory proteins. It is thou^t that each type of cell in a higher eucaryotic organism 
contains a specific combination of gene regulatory proteins that ensures the expres- 
sion of only those genes appropriate to that type of cell A given gene regulatory pro- 
tein may be expressed in a variety of circumstances and typically is involved in the 
reflation of many genes. 

In addiUon to diffusible gene regulatory proteins, inherited states of chromatin 
condensation are also utilized by eucaryotic cells to regulate gene expressioru In ver- 
tebrates DNA methylatiqn also plays a part, mainly as a device to reinforce decisions 
ttbotttgene expression that are made initially by other mechanisms. 



Posttranscriptional Controls 

Although controls on the initiation of gene transcription are the predominant 
fonn of regulation for most genes, other controls can act later in the pathway 
from RNA to protein to modulate the amount of gene product that is made. Al- 
IJiough these posttranscriptionai controls, which operate after RNA polymerase 
nas bound to the gene's promoter and begun RNA synthesis, are less common 
^^transcriptional control, foi many genes they are crucial. It seems that every 
step in gene expression that could be controDed in principle is likely to be regu- 
ated under some circumstances for some genes. 

We consider the varieties of posttranscriptionai regulation in temporal or- 
er, according to the sequence of events that might be experienced !?y an RNA 
^lecule after its transcription has begun (Figure 9-72). 
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Figure 6-3 Genes can be expressed 
with different efficiencies. Gene A Is 
transcribed and translated much more 
efTtcientl/ than gene B.This allows the 
amount of protein A in the cell to be 
much greater than that of protein B. 
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FROM DNA TO RNA 

Transcription and translation are the means by which cells read out, or express, 
the genetic instructions in their genes. Because many identical RNA copies can 
be made from the same gene, and each RNA molecule can direct the synthesis 
of many identical protein molecules, cells can synthesize a large amount of 
protein rapidly when necessary. But each gene can also be transcribed and 
Uanslated with a different efficiencyi allowing the cell to make vast quantities of 
some proteins and tiny quantities of others (Figure 6-3). Moreover, as we see in 
the next chapter, a cell can change (or regulate) the expression of each of its 
genes according to the needs of the moment— most obviously by controlling 
the production of its RNA. 



Portions of DNA Sequence Are Transcribed into RNA 

The first step a cell takes in reading out a needed part of its genetic instructions 
is to copy a particular portion of its DNA nucleotide sequence— a gene— into an 
RNA nucleotide sequence. The information in RNA, although copied into another 
chemical form, is still written in essentially the same language as it is in DNA — 
the language of a nucleotide sequence. Hence the name transcription. 

like DNA, RNA is a linear polymer made of four different types of nucleotide 
subunits linked together by phosphodiester bonds ffigure 6-4). It differs from 
DNA chemicaDy in two' respects: (1) the nucleotides in RNA are 
ribonucleotides— ihsit is, they contain the sugar ribose (hence the name ribonu- 
cleic acid) rather than deoxyribose; (2) although, like DNA, RNA contains the 
bases adenine (A), guanine (G), and cytosine (C), it contains the base uracil (U) 
instead of the thymine (T) in DNA. Since U, like X can base-pair by hydrogen- 
bonding with A (Figure 6-5), the complementary base-pairing properties 
described for DNA in Chapters 4 and 5 apply also to RNA (in RNA, G pairs with 
C, and A pairs with U), It is not imcommon, however, to find other types of base 
pairs in RNA: for example, G pairing with U occasionally. 

Despite these small chemical differences, DNA and RNA differ quite dra- 
matical^ in overall structure. Whereas DNA always occurs in cells as a double- 
stranded helix, RNA is single-stranded. RNA chains therefore fold up into a 
variety of shapes, just as a polypeptide chain folds up to form the final shape of 
a protein (Figure 6-6). As we see later in this chapter, the ability to fold into com- 
plex tiiree-dimensional shapes allows some RNA molecules to have structural 
and catalytic frmctions. 



Transcription Produces RNA Complementary to 
One Strand of DNA 

AH of the RNA in a cell is made by DNA transcription, a process that has cer- 
tain similarities to the process of DNA replication discussed in Chapter 5. 



302 



Chapter 6 : HOW CELLS READ THE GENOME: FROM DNA TO PROTEIN 




Figure 6-89 Protein aggregates that cause human disease. (A) Schematic iHustratlon of the type of 
conformational change In a protein that produces material for a cross-beta filament (B) Diagram illustrating 
the self-infectious nature of the protein aggregation that Is central to prion diseases. PrP is high!/ unusual 
because the misfolded version of the protein, called PrP*, induces the nbrmal PrP protein it contacts* to 
change Cts conformation, as shown. Most of the human diseases caused by protein aggregation are caused by 
the overproduction of a variant protein that is especially prone to aggre^tion, but because this structure is 
not infectious in this way. it cannot spread from one aninruJ to another. (C) Drawing of a cross-beta filament 
a common type of protease-resistant protein aggre^te found in a variety of human neurological diseases. 
Because the li/drogen-bond interactions in a P sheet form between polypeptide bacid>one atoms (see Figure 
3-9), a number of different abnormally folded proteins can produce ^Is structure. (D) One of several 
possible models for the conversion of PrP to PrP*, showing the lllcely change of two (x-heiices into four 
^sQwds. Althou^ the structure of the normal protein has been determined accurately, ^e structure of the 
infectious form Is not yet laiown v/ith certainty because the aggregation has prevented the use of standard 
structural techniques. (C, courtesy of Louise Serpell. adapted from M. Sunde et al.tj.Moll fifot 273:729-739, 
1 997; D. adapted from S.B. Prusiner. Trends Bfoc/tem. Sd. 21 :482-487, i 996.) 

animals and htimans. It can be dangerous to eat the tissues of animals that con- 
tain PrP*, as witnessed most recently by the spread of BSE (commonly referred 
to as the "mad cow disease") from cattle to humans in Great Britain. 

Fortunately, in the absence of PrP*, PrP is extraordinarily difficult to convert 
to its abnormd form. Although very few proteins have the potential to misfold 
into an infectious conformation, a similar transformation has been discovered 
to be the cause of an otherwise mysterious 'protein-only inheritance" observed 
in yeast cells. 

There Are Many Steps From DNA to Protein 

We have seen so far in this chapter that many different types of chemical reac- 
tions are required to produce a properly folded protein from the information 
contained in a gene (Figure &-90). The final level of a properly folded protein in 
a cell therefore depends upon the efficiency with v\Aiich each of the many steps 
is performed. 

We discuss in Chapter 7 that ceils have the ability to change the levels of 
their proteins according to their needs. In principle, any or all of tiie steps in Fig- 



FROM RNATO PROTEIN 



363 




lAAAA mRNA 
EXPOFTT poIy-Atail 



NUCLEUS 



CVTOSOL 



t^tKfKPJK mRNA 



Y 

t IN 



mRNA DEGRADATION 



INITIATION OF PROTEIN SYNTHESIS 
AAAA 

COMPLETION OF PROTEIN SYNTHESIS 
AND PROTEIN FOLDING 

N 



'COOH 

\r-*^ PROTEIN DEGRADATION 



0(^C( 



Figure 6^0 The production of a 
protein by a eucaryotic celLThe final 
level of each protein In a eucaryotic cell 
depends upon the efficiency of each step 
depicted. 
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ure 6-90) could be regulated by the cell for each individual protelnu However, as 
we shall see in Chapter 7, the initiation of transcription is the most common 
point for a cell to regulate the expression of each of its genes. This makes sense, 
inasmuch as the most efficient way to keep a gene from being expressed is to 
block the very first-step— the transcription of Its DNA sequence into an RNA 
molecule. 



Summary 

77ie tmrtslation of the nucleotide sequence of an mRNA molecule into protein takes 
place in the cytoplasm on a large rihonucleoprotein assembly called a ribosome. The 
amino acids used for protein synthesis are first attached to a family of tRNA 
molecules, each of which recognize, by complementary base-pair interactions, par- 
ticular sets of three nucleotides in the mBNA (codons). The sequence ofnucleotides in 
the mRNA is then read from one end to the other in sets of three according to the 
genetic code. 

To initiate translation, a small ribosomal subunit binds to the mRNA molecule 
at a start codon (AUG) that is recognized by a unique initiator tRNA molecule. A 
large ribosomal subunit binds to complete the Hbosome and begin the elongation 
phase of protein synthesis. During this phase, aminoacyl tRNAs—each bearing a 
specific amino acid bind sequentially to the appropriate codon in mRNA by forming 
complementary base pairs with the tRNA anUcodon, Each amino acid is added to the 
C-terminal end of the growing polypeptide by means of a cycle of'&tree sequential 
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Figure 7^ Six steps at which 
eucaiyotic gene expr^sion can be 
controlled. Controls that operate at 
steps f through 5 are discussed ip this 
chapter. Step 6, the reflation of protein 
activity; Includes reversible acdvatton or 
inactlvatlon by protein phosphorylation 
(discussed in Chapter 3) as well as 
irreversible mactivation by proteolytic 
degradation (discussed in Chapter 6). 



Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein 

If differences among the various cell types of an organism depend on the partic- 
ular genes that the cells egress, at what level is the control of gene ejqpressipn 
exercised? As we saw in the last chapter, there are many steps in the pathway 
leading from DNA to protein, and all of them can in principle be regulated. Thus 
a cell can control the proteins it makes by (1) controlling when and how often a 
given gene is transcribed [transcriptional control}, (2) controlling how the RNA 
transcript is spliced or otherwise processed (RNA processing control}, (3) 
selecting which completed mRNAs in the cell nucleus are exported to the cytosol 
and determming where in the cytosol they are localized (RNA transport and 
localization control), (4) selecting which mRNAs in the cytoplasm are translated 
by ribosomes (translational control), (5) selectively destabilizing certain mRNA 
molecules in the cytoplasm (mRNA degradation control), or (6) selectively acti- 
vating, inactivating, degrading, or compartmentalizing specific protein 
molecules after they have been made (protein activity control) (Figure 7-5) . 

For most genes transcriptional controls are paramount. This makes sense 
because, of all the possible control points illustrated in Figure 7-5, only tran- 
scriptional control ensures that the cell will not synthesize superfluous interme- 
diates. In the foUowing sections we discuss the DNA and protein components 
that perform this function by regulating the initiation of gene transcription. We 
shall return at the end of the chapter to the additional ways of regulating gene 
eTcpression. 



Summary 

The genome of a cell contains in its Dm sequence the information to make many 
thousands of different protein and RNA molecules. A celi typically expresses only a 
fraction of its genes, and the different types of cells in multicellular organisms arise 
because different sets of genes are expressed. Moreover, cells can change the pattern 
of genes they express in response to changes in their environment^ sudh as signals 
from other cells. Although aU of the steps involved in expressinga gene can in prin- 
ciple be regulated, for most genes the initiation of RNA tmnscription is the most 
. important point of control 



DNA-BINDING MOTIFS IN GENE REGULATORY 
PROTEINS 

How does a cell determine which of its thousands of genes to transcribe? As 
mentioned briefly in Chapters 4 and 6, flie transcription of each gene is con- 
trolled by a regulatory region of DNA relatively near the site where transcription 
begins. Some regulatory regions are simple and act as switches that are thrown 
a single signal. Many others are complex and act as tiny microprocessors, 
■xe^onding to a variety of signals that they interpret and integrate to switch the 
^neighboring gene on or off. Whether complex or simple, these switching devices 

X.' 
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occur in the germ line, the cell lineage that gives rise to sperm or eggs. Most of 
the DNA in vertebrate germ cells is inactive and highly methylated. Over long 
periods of evolutionary time, the methylated CG sequences in these inactive 
regions have presumably been lost through spontaneous deamination events 
that were not properly repaired. However promoters oif genes that remain active 
in the germ cell lineages (including most housekeeping genes) are kept 
immethylated, and therefore spontaneous deaminations of Cs that occur with- 
in them can be accurately repaired Such regions are preserved in modem day 
vertebrate cells as CG islands. In addition, any mutation of a CG sequence in the 
genome that destroyed the function or regulation of a gene in the adult would be 
selected against, and some CG islands are simply the result of a higher than nor- 
mal density of critical CG sequences. 

The mammalian genome contains an estimated 20,000 CG islands. Most of 
the islands mark the 5' ends of transcription units and thus, presumably of 
genes. The presence of CG islands often provides a convenient way of identify- 
ing genes in the DNA sequences of vertebrate genomes. 

Summary 

The many types of cells in animals and plants are created largely through mecha- 
nisms that cause different genes to be transcribed in different cells. Since many 
specialized animal cells can maintain their unique character throu^ many cell 
division cycles and even when grown in culture, the gene r^ukOoiy mechanisms 
involved in creating them must be stable once established and heritable when tfte 
cell divides. These features endow the cell with a memory of its developmental history. 
Bacteria and yeasts provide unusuaUy accessible model systems in which to study 
gene regulatory mechanisms. One such mechanism involves a competitive tnterdc- 
tion between two gene regulatory proteins, each of which inhibits the synthesis of the 
other, this can create a flip-flop switch that switches a cell between two alternative 
patterns of gene expression. Direct or indirect positive feedback loops, which enable 
gene regulatory proteins to perpetuate their own synthesis, provide a general mech- 
anism for cell memory. Negative Jeedbatk loops ivtOi programmed delays form the 
basisfor cellular clocks. 

In eucaryotes the transcription of a gene is generally controlled by combinations 
of gene regulatory proteins. It is thought that each type of cell in a higher eucaryotic 
organism contains a specific combination of gene regulatory proteins that ensures 
the expression of only those genes appropriate to that type of cell A given gene regu- 
latory protein may be active in a variety of circumstances and typically is involved 
in the regulation of many genes. 

In addition to diffusible gene regulatory proteins, inherited states of chromatin 
condensation are also used by eucaryotic cells to regulate gene expression. An espe- 
cially dramatic case is the inactivation of an entire X chromosome in female mam- 
mals. In vertebrates DNA methylation also functions in gene regulation, being used 
mainly as a device to reinforce dedsiorts about gene expression that are made ini- 
tially by other mechanisms. DNA methylation also urtderlies the phenomerwn of 
genomic imprinting in mammals, in which the expression of a gene depends on 
whether it was inheritedflrom the mother or the father. 
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Figure 7-86 A mechanism to explain 
both the marked overall deficiency 
of CG sequences and their clustering 
into CG islands in vertebrate 
genomes, A black line marks the location 
of a CG dinudeotide in the DNA 
sequence, while a red "lollipop" Indicates 
the presence of a meth/l group on die 
CG dinudeotide. CG sequences that lie in 
regulatory sequences of genes that are 
transcribed in germ cells are unmeth/lated 
and therefore tend to be retained in 
evolution. Meth/fated CG sequences, on 
the other hand, tend to be lost through 
deamination of S-methyl C toT, unless the 
CG sequence is critical for survival. 



POSTTRANSCRIPTIONAL CONTROLS 

In principle, every step required for the process of gene expression could be 
controlled. Indeed, one can find examples of each type of regulation, although 
any one gene is likely to use only a few of them. Controls on the initiation of 
gene transcription are the predominant form of regulation for most genes. But 
other controls can act later in the pathway from DNA to protein to modulate 
the amount of gene product that is made. Although these posttranscrlptionai 
controls, which operate after RNA polymerase has bound to the gene's promoter 
and begun RNA synthesis, are less common than transcriptional control^ for 
niany genes they are crucial. 



POSTTRANSCRIPTIOISIAL CONTROLS 



435 



CHAPTER 29 



Regulation of transcription 



-flie plienot>-pic (HfTemices ihat dislinguish the 
yflrious kinds of cells in a higher euknryole are 
largely due lo (hTTerenres in ihe expression of 
genes thai code for proieins. thni is. those tran- 
5cHbed by RXA polymerase fl. hi principle, ihe 
expression of ihese ^r^'nes niiphl he rejiulaied at 
floy one ol' several singes. Tfie nmcepi of the 
-level or conlror implies ihai gene expression 
* is nol necessarily an automatic process once it 
has begun. It could he regiilaied in a gene- 
specillc way ai any one of several sequential 
steps- We can distinguish (at least) five poten- 
tial control points, fomiing the series: 

Activation of gene striu-iiire 
I 

Inltiatton of transcription 
I 

Processing Ihc iraitscript 
i 

Transport, lo cvtoplasni 
i 

Translathm of mHNA 

Tl»e e.vislmce of the lirsi step Is Implied by 
the discover}- thai genes may exist in either oV 
two structural conditions. Helative to ihe slate 
f»r most of the genome, genes are found hi 
iiii '"aclive" stale In the cells in which thev 
are expressed (see Chnpier 27). The change of 
slruciiire is distinct from the act of transcrip- 
lion, and indicates that the gene is *'transcrib- 
abie.'' This suggests that acquisition of ihe 
-active" structure must be Ihe lirst slep In gene 
expression. 

Transcription of a gene in the active stale is 



controlled at the stage of initiation, that is. hv 
the inleraclion of R.NA polymerase with its pro- 
moter. This is now becoming susceptible lo 
analysis in the in viltv systems (see Chapter 
For most genes, this is a major control 
point: prol)ably ii is the most common level of 
regulation. 

There is at present no evidence for coniro) 
at subsequent stages of transcription in eukar)*- 
olic cells, for example, via antilerminalion 
mechanisms. 

The primaiT transcript is modified by capping 
at the 5' end, and usually al.<;o by polyadenyla- 
lion at the 3' end. Inirons must be spliced out 
from the iran.scripts of interrupted genes. The 
mature a\'A must l)e exported IVom ihe nudeus 
to llie cytoplasm. Regulation of gene expression 
by seleciion of sectuences al Ihe levef of nuclear 
RNA might involve any or all or these stages, 
bill the one for which we have most evidence 
concerns changes in splicing: some genes are 
expressed by means of alternative splicing pal- 
terns who.se reguintion controls Ihe type of pro- 
tein product (see Chapter 'M>). 

Finally. Ihe translation of an mHNA in ihe cylo- 
pla.sm can be specirtcally controlled. There is Htlle 
evidence for the employment of this mechanism in 
adult somatic cells, but it does occur in some 
embnonic situations, as described In Chapter 7. 
The mechanism is presumed lo involve the block- 
ing of initiation of translation of some mRNAs by 
specific protein factors, 

But having acknowledged thai control of gene 
expression can occur al multiple stages, and 
that production of RNA cannot inevitably be 
equated with production of protein, it is clear 
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thai the ovenvhelming majority of regulatory 
events occur at^the^ initiation of transcription. 
Regulation of tissue-specific gene transcription 
lies at thie heart of eukar>'0lic differentiation; 
indeed, Nve see examples in Chapter 38 in 
which proteins thai regulate embr>'onic devel- 
opment prove to be transcription factors. A reg- 
ulatory transcription factor serves to proide 



common control of a large number of target 
genes, and we seek to answer two questions 
about this mode of regulation: what identlRes 
the common target genes to the IranscripUon 
factor; and how is the activity of the transcrip- 
tion factor itself regulated in response to intrin- 
sic or exirinsic signals? 



Response elements identify genes under common 
regulation 



The principle that emerges from characterizing 
groups of genes under common control is that 
they' share a pivmoter element that is recogni^td 
by a regulator;)' transcription factor. \n element 
that causes a gene to respond to such a factor 
is called a response element; examples are the 
HSE (heal shock response element), ORE 
(glucocorticoid response element), SRE (serum 
response element). 

The properties of some inducible transcription 
factors and the elements that they recognize are 
summarized in. Table 29.1. Response elements 
have the same general characteristics as 
upstream elements of promoters or enhancers. 
They contain short consensus sequences, and 
copies of the response elements found in dif- 
ferent genes are closely related, but not neces- 
sarily identical. The region bound by the factor 
extends for* a short distance on either side of 



Table 29.1 Inducible transcripiion factors bind to 
response elements that identity groups oJ promoters 
or enhancers subject to coordmate control. 



Regulatory Agent Module Conser>su5 



Factor 



Heal shock HSE CNNGAANNTCCNNG HSTF 

Glucocortfcoid GRE TGGTACAAATGTTCT Receptor 

Plwrtjol ester TRE TGACTCA API 

Sefum SRE CCATAITAGG SRF 



the consensus sequence. In promoters, the ele- 
ments are not present at fixed distances from 
the startpoint, but are usually <200 bp upsu^am 
of it. The presence of a single element usually 
is suCTiciem to confer the regulatory response, 
but sometimes there are multiple copies. 

Response elements may be located in 9^ 
moters or in enhancers. Some types of elements 
are typically found in one rather than the ollW^ 
usually an HSE is found in a promoter, ^hilc > 
GRE Is round in an enhancer. We assume 
all response elements function by the san»f 
general principle. A gene is regulated b?' ^ 
sequence at tlie promoter or enhancer that ^> 
recognized by a specific protein. The P^^Jl 
Junctions as a transtription factor needed J*^ 
RNA polymerase to initiate. Active protein ^ 
available only under conditions when the^ ^ 
to be expressed; its absence means that the ^ 
moter is not activated by this particular 
An example of a situation in which ' 



genes are controlled by a single factor 



is P^*' 



vided by the heat shock response. This ts ^^^^ 
mon to a wide range of prokaryoi^^ 
eukar>'otes and Involves multiple ^^"^^Ijnifi* 
gene expression: an increase in tcmp^^ 
turns ofT transcription of some geneSi 
transcription of the heat shock 6*"^«|^^- 
causes changes in the translation of ''V^if* 
The control of the heat shock genes >J|^^ ^^fd 
the differences between prokaryotic 
eukaryoiic modes of control. In- bactcria» ^ 
sigma factor is synthesized that 
polymerase hoioenzyme to recognize an. 
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Abstract 

Background: Prostate stem cell antigen (PSCA) Is a recently defined homologue of the Thy.I/Ly-6 femily of 
giycosylphosphatidyllnositol (GPI)-anchored cell surface antigens. The purpose of the present study was to 
examine the expresston status of PSCA protein and mRNA In clinical specimens of human prostate cancer (Pea) 
and to validate it as a potential molecular target for diagnosis and treaonent of Pea. 

Materials and Methods: Immunohlstochemical (IHC) and in situ hybridization (ISH) analyses of PSCA 
expression were simultaneously performed on paraffin-embedded sections from 20 benign prostatic hyperplasia 
(BPH), 20 prostatic Intraepithelial neoplasm (PIN) and 48 prostate cancer (Pea) tissues, including 9 androgen- 
Independent prosute cancers. The level of PSCA expression was semiquantitatlvely scored by assessing both the 
percentage and Intensity of PSCA-positive staining cells in the specimens. Then compared PSCA expression 
between BPH, PIN and Pea tissues and analysed the correlations of PSCA expression level with pathological grade, 
clinical stage and progression to androgen-independence in Pea. 

Results: In BPH and low grade PIN, PSCA protein and mRNA staining were weak or negative and less intense 
and uniform than that seen in HGPIN and Pea. There were moderate to strong PSCA protein and mRNA 
expression in 8 of I i (72.7%) HGPIN and in 40 of 48 (83.4%) Pea specimens examined by IHC and ISH analyses, 
with statistical significance compared with BPH (20%) and low grade PIN (112%) samples (p < 0,05, respectively). 
The expression level of PSCA increased with high Gleason grade, advanced stage and progression to androgen- 
independence (p < 0.05» respectively). In addition. IHC and ISH staining showed a high degree of correlation 
between PSCA protein and mRNA overexpression. 

Conclusions: Our data demonstrate that PSCA as a new cell surface marker is overexpressed by a majority of 
human Pea. PSCA expression correlates positively with adverse tumor characteristics, such as increasing 
pathological grade (poor cell differentiation), worsening clinical stage and androgen-independence, and 
speculatively with prostate carcinogenesis. PSCA protein overexpression results from upregulated transcription 
of PSCA mRNA. PSCA may have prognostic utility and nuiy be a promising molecular target for diagnosis and 
treatment of Pea. 
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Introduction 

Prostate cancer (Pea) is the second leading cause of can- 
cer-related death in American men and is becoming a 
common cancer inaeasing in China. Despite recently 
great progress In the diagnosis and management of local- 
ized disease, there continues to be a need for new diagnos- 
tic markers that can accurately discriminate between 
indolent and aggressive variants of Pea. Hicre also contin- 
ues to be a need for the identification and charaaerization 
of potential new therapeutic targets on Pea cells. Current 
diagnostic and therapeutic modalities for recurrent and 
metastatic Pea have been limited by a lack of spedfic tar- 
get antigens of Pea. 

Although a number of prostate-specific genes have been 
identified (i.e. prostate spedfic antigen, prostatic add 
phosphatase, glandular kallikrein 2], the majority of these 
are seaeted proteins not ideally suited for many immuno- 
logical strategies. So^ the identification of new cell surface 
antigens is critical to the development of new diagnostic 
and therapeutic approaches to the management of Pea. 

Reiter RE et al |1] reported the identification of prostate 
stem cell antigen (PSCA), a cell surface antigen that is pre- 
dominantly prostate specific. The PSCA gene encodes a 
123 amino acid glycoprotein, with 30% homology to 
stem cell antigen 2 (Sea 2). Like Sca-2, PSCA also belongs 
to a member of the Thy-l/Ly-6 family and is anchored by 
a glycosylphosphatidylinositol (GPI) linkage. mRNA in 
situ hybridization (I5H) localized PSCA expression in nor- 
mal prostate to the basal cell epithelium, the putative 
stem cell compartment of prostatic epithelium, suggesting 
that PSCA may be a marker of prostate stem/progenitor 
cells. 

In order to examine the status of PSCA protein and mRNA 
expression in human Pea and validate it as a potential 
diagnostic and therapeutic target for Pea, we used immu- 
nohistochemistry (IHC) and in situ hybridization (ISH) 
simultaneously, and conducted PSCA protein and mRNA 
expression analyses in paraffin^embedded tissue sped- 
mens of benign prostatic hyperplasia (BPH, n » 20), pros- 
tate intraepithelial neoplasm (PIN, n « 20) and prostate 
cancer (Pea, n = 48). Furthermore, we evaluated the possi- 
ble correlation of PSCA expression level with Pea tumori> 
genesis, grade, stage and progression to androgen- 
independence. 

Materials and methods 
Tissue samples 

All of the clinical tissue specimens studied herein were 
obtained from 80 patients of 57-84 years old by prostate- 
ctomy, transurethral resection of prostate (TURP) or biop- 
sies. The patienu were dassified as 20 cases of BPH, 20 
cases of PIN, 40 cases of primary Pea, including 9 patients 



with recurrent Pea and a history of androgen ablation 
therapy (orchiectomy and/or hormonal therapy), who 
were referred to as androgen-independent prostate can- 
cers. Eight spedmens were harvested from diese andro- 
gen-independent Pea patients prior to androgen ablation 
treatment. Each tissue sample was cut Into two parts, one 
was fbced in 10% formalin for IHC and the other ueated 
with 4% paraformaldehyde/0.1 M PBS PH 7.4 in 0.1% 
DEPC for 1 h for ISH analysis, and then embedded in par- 
affin. All paraffin blocks examined were then cut into 5 
^m seaions and mounted on the glass slides specific for 
IHC and ISH respectively in the usual fashion. H&E- 
stained section of each Pea was evaluated and assigned a 
Gleason score by the experienced urological pathologist at 
our institution based on the criteria of Gleason score (2). 
The Gleason sums are summarized in Table 1. Clinical 
staging was performed according to Jewett-whitmore- 
prout staging system, as shown in Table 2. In the category 
of PIN, we graded the specimens Into two groups, i.e. low 
grade PIN (grade I - II) and high grade PIN (HGPIN, 
grade III) on the basis of literatures |3,4]. 

Immunohistochemical (IHC) analysis 
Briefly, tissue sections were deparaffinized, dehydrated, 
and subjeaed to mierowaving in 10 mmol/L dtrate 
buffer, PH 6.0 (Boshide, Wuhan, China) in a 900 W oven 
for 5 min to induce epitope retrieval. Slides were allowed 
to cool at room temperature for 30 min. A primary mouse 
antibody specific to human PSCA (Boshide, Wuhan, 
China) with a 1 : 100 dilution was applied to incubate with 
the slides at room temperature for 2 h. Labeling was 
detected by sequentially adding biotinylated secondary 
antibodies and strepavldin-peroxidase, and localized 
using 3,3*-diaminobenzidine reaction. Sections were then 
counterstained with hematoxylin. Substitution of the pri- 
mary antibody with phosphate-buffered-saline (PBS) 
served as a negative-staining control. 

mRNA in situ hytriiiizatlon (ISH) 
Five-nm-thick Ussue seaions were deparaffinized and 
dehydrated, then digested in pepsin solution (4 mg/ml in 
3% dtric acid) for 20 min at 37.5 "C, and fUrther proc- 
essed for ISH. Digoxigenin-labeled sense and antisense 
human PSCA RNA probes (obtained from Boshide, 
Wuhan, China) were hybridized to the sections at 48'C 
overnight. The posthybridization wash with a high strin- 
gency was performed sequentially at 37 * C in 2 x standard 
saline citrate (SSC) for 10 min, in 0.5 » SSC for 15 min 
and in 0.2 x SSC for 30 min. The slides were then incu- 
bated to biotinylated mouse anii-dlgoxigenin antibody at 
37. 5*0 fori h followed by washing in I k PBS for 20 min 
al room temperature, and then to strepavidin-peroxidase 
at 37.5 *C for 20 min followed by washing in 1 x PBS for 
15 min at room temperature. Subsequently, the slides 
were developed with diaminobenzidine and then coun- 
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Table I: Correlation of PSCA eacpresslon with Gleason score 



Intensity x frequency 



Gleason score 



2-4 

5-7 

8-10 



5(83) 
19(79) 
S(28) 



1(17) 
5(21) 
13(72) 



Table 2: Correlation of PSCA expression with clinical stage 



Intensity x frequency 



Tumor stage 



0-6 (X) 



9(%) 



27 (67.5) 



13(32.5) 
6(75) 



terstained with hematoxylin to localize the hybridization 
signals. Sections hybridized with the sense control probes 
routinely did not show any specific hybridization signal 
above background. All slides were hybridized with PBS to 
substitute for the probes as a negative conuoL 

Scoring methods 

To determine the correlarion between the results of PSCA 
immunostaining and mRNA in situ hybridization, the 
same scoring manners are taken in the present study for 
PSCA protein staining by IHC and PSCA mRNA staining 
by ISH. Each slide was read and scored by two independ- 
ently experienced urological pathologists using Olympus 
BX-41 light microscopes. The evaluation was done in a 
blinded fashion. For each section, five areas of similar 
grade were analyzed semiquantitatively for the fiaaion of 
cells staining. Fifty percent of specimens were randomly 
chosen and rescored to determine the degree of interob- 
server and intiaobserver concordance. There was greater 
than 95% intra- and Interobserver agreement. 

The intensity of PSCA expression evaluated microscopi- 
cally was graded on a scale of 0 to 3+ with 3 being the 
highest expression observed (0, no staining; 1+, mildly 
intense; 2+, moderately intense; 3+, severely intense). The 
staining density was quantified as the percentage of cells 
staining positive for PSCA with the primary antibody or 
hybridization probe, as follows: 0 = no staining; 1 = posi- 
tive staining in <25% of the sample; 2 » posidve staining 
in 25%-50% of the sample; 3 = positive staining in >50% 



of the sample. Intensity score (0 to 3+) was multiplied by 
the density score (0-3) to give an overall score of 0-9 
(1,5]. In this way, we were able to differentiate specimens 
that may have had focal areas of inaeased staining from 
those that had diffuse areas of increased staining [6). The 
overall score for eadi specimen was then categorically 
assigned to one of the following groups: 0 score, negative 
expression; 1-2 scores, weak expression; 3-6 scores, mod- 
erate expression; 9 score, strong expression. 

Statistical analysis 

Intensity and density of PSCA protein and mRNA expres- 
sion in BPH, PIN and Pea tissues were compared using the 
Chi-square and Student's i-test. Univariate associations 
between PSCA expression and Cleason score, clinical 
stage and progression to androgen- independence were 
calculated using Fisher's Exact Test. For all analyses, p < 
0.05 was considered statistically significant. 

Results 

PSCA expression in BPH 

In general, PSCA protein and mRNA were expressed 
weakly in individual samples of BPH. Some areas of 
prostate expressed weak levels (composite score 1-2), 
whereas other areas were completely negative (composite 
score 0). Four cases (20%) of BPH had moderate expres- 
sion of PSCA protein and mRNA (composite score 4-6) 
by IHC and ISH. In 2/20 (10%) BPH specimens, PSCA 
mRNA expression was moderate (composite score 3-6], 
but PSCA protein expression was weak (composite score 
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2) in one and negative (composite score 0) in the other. 
PSCA expression was localized to the basal and secretory 
epithelial cells, and prostatic stroma was almost negative 
staining for PSCA protein and mRNA in all cases 
examined 

PSCA expression In PIN 

In this study, we detected weak or negative expression of 
PSCA protein and mRNA scores) in 7 of 9 (77.8%) 
low grade PIN and in 2 of 11 (18.2%) HGPIN, and mod- 
erate expression (3-6 scores) in the rest 2 low grade PIN 
and 5 of II (45.5%) HGPIN. One HGPIN with moderate 
PSCA mRNA expression (6 score) was found weak stain- 
ing for PSCA protein (2 score) by IHC, Strong PSCA pro- 
tein and mRNA expression (9 score) were detected in the 
remaining 3 of 11 (27.3%) HGPIN. There was a statisti- 
cally significant difference of PSCA protein and mRNA 
expression levels obsetved between HGPIN and BPH (p < 
0.05)« but no statistical diiference reached between low 
grade PIN and BPH (p > 0.05). 

PSCA expression In Pea 

In order to determine if PSCA protein and mRNA can be 
detected in prostate cancers and If PSCA expression levels 
are increased in malignant compared with benign glands, 
Forty>eight paraffin-embedded Pea specimens were ana* 
lysed by IHC and ISH. It was shown that 19 of 48 (39.6%) 
Pea samples stained very strongly for PSCA protein and 
mRNA with a score of 9 and another 21 (43.8%) speci- 
mens displayed moderate staining with scores of 4-6 (Fig- 
ure I). In addition, 4 specimens with moderate to strong 
PSCA mRNA expression (scores of 4-9) had weak protein 
staining (a score of 2) by IHC analyses. Overall, Pea 
expressed a significantly higher level of PSCA protein and 
mRNA than any other specimen category in this study (p 
< 0.05, compared with BPH and PIN respeaively). The 
result demonstrates that PSCA protein and mRNA are 
overexpressed by a majority of human Pea. 

Correlation of PSCA expression with Gleason score in Pea 

Using the semi-quantitative scoring method as described 
in Materials and Methods, we compared the expression 
level of PSCA protein and mRNA with Gleason grade of 
Pea, as shown in Table 1. Prostate adenocarcinomas were 
graded by Gleason score as 2-4 scores « well-differentia- 
tion, 5-7 scores = moderate-differentiation and 8-10 
scores = poor-differentiation [7]. Seventy-two percent of 
Gleason scores 8-10 prostate cancers had very strong 
staining of PSCA compared to 21% with Gleason scores 
5-7 and 17% with 2-4 respectively, demonstrating that 
poorly differentiated Pea had significantly stronger 
expression of PSCA protein and mRNA than moderately 
and well differentiated tumors (p < 0.05). As depicted In 
Figure 1, IHC and ISH analyses showed that PSCA protein 
and mRNA expression in several cases of poorly differen- 



tiated Pea were particularly prominent, with more intense 
and uniform staining. Itie results indicate that PSCA 
expression increases significantly with higher tumor grade 
in human Pea. 

CorrelaUon of PSCA expression with clinical stage In Pea 
With regards to PSCA expression in every stage of Pea, we 
showed the results in Table 2. Seventy-five percent of 
locally advanced and node positive cancers (i.e. C-D 
stages) expressed statistically high levels of PSCA versus 
32.5% that were organ confined (i.e. A-B stages) (p < 
0.05). The data demonstrate that PSCA expression 
increases significantly with advanced tumor stage in 
human Pea. 

Correlation of PSCA expression with androgen^ 
independent progression of Pea 

All 9 Specimens of androgen-independent prostate can- 
cers stained positive for PSCA protein and mRNA, Eight 
specimens were obtained from patients managed prior to 
androgen ablation therapy. Seven of eight (87.5%) of 
these androgen-independent prostate cancers were in the 
strongest staining category (score « 9), compared with 
three out of eight (37.5%) of patients with androgen- 
dependent cancers (p < 0.05). The results demonstrate 
that PSCA expression increases significantly with progres- 
sion to androgen-independenee of human Pea. 

It is evident from the results above that within a majority 
of human prostate cancers the level of PSCA protein and 
mRNA expression correlates significandy with Increasing 
grade, worsening stage and progression to androgen-inde- 
pendenee. 

Correla^on of PSCA immunostaining and mRNA in situ 
hybridization 

In all 88 specimens surveyed herein, we compared the 
results of PSCA IHC staining with mRNA ISH analysis. 
Positive staining areas and its intensity and density scores 
evaluated by IHC were identical to those seen by ISH in 79 
of 88 (89.8%) specimens (18/20 BPH, 19/20 PIN and 42/ 
48 Pea respectively). Importantly, 27/27 samples widi 
PSCA mRNA composite scores of 0-2, 32/3(S samples 
with scores of 3-6 and 22/24 samples with a score of 9 
also had PSCA protein expression scores of 0-2, 3-6 and 
9 respectively. However, in 5 samples with PSCA mRNA 
overall scores of 3-6 and in 2 with scores of 9 there were 
less or negative PSCA protein expression (i.e. scores of 0- 
4), suggesting that this may reflect posttranscriptional 
modification of PSCA or that the epitopes recognized by 
PSCA mAb may be obscured in some cancers. The data 
demonstrate that the results of PSCA immunosuining 
were consistent with those of mRNA ISH analysis, show- 
ing a high degree of correlation between PSCA protein 
and mRNA expression. 
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Figure t 

Representatives of PSCA IHC and ISH staining in Pea (A. IHC stoinlng, B. ISH staining, x200 magnification). A|, B|: negative con- 
trol of IHC and ISH. PBS replacing the primary antibody (A,) and hybridization with a sense PSCA probe (B|) showed no back- 
ground staining. Aj, Bj: a moderately differentiated Pea (Gleason score = 3+3 = 6) with moderate staining (composite score = 
6) in all malignant cells; Aj: IHC show$ not only cell surface but also apparent cytoplasmic staining of PSCA protein. Aj, Bj: a 
poorly differentiated Pea (Gleason score = 4+4 = 8) with very strong staining (composite score = 9) in all malignant cells. 
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Discussion 

PSCA Is homologous to a group of cell surface proteins 
that mark the earliest phase of hematopoietic develop- 
nnent. ?SCK mRNA expression is prostate-spedfic in nor- 
mal male tissues and is highly up-regulated in both 
androgen-dependent and-independent Pea xenografts 
(LAPC-4 tumors)< We hypothesize that PSCA may play a 
role in Pea tumorigenesis and progression, and may serve 
as a target for Pea diagnosis and treatment. In this study, 
IHC and ISH showed that in general there were weak or 
absent PSCA protein and mRNA expression in BPH and 
low grade PIN tissues. However, PSCA protein and mRNA 
are widely expressed in HGPIN, the putative precursor of 
Invasive Pea, suggesting that up-regulation of PSCA is an 
eaiiy event in prostate carcinogenesis. Recently, Reiter RE 
etal [1], using ISH analysis, reported that 97 of 118 (82%) 
HGPIN specimens stained strongly positive for PSCA 
mRNA. A very similar finding was seen on mouse PSCA 
(mPSCA) expression in mouse HGPIN dssues by Tran C. 
P ct al (8). These data suggest that PSCA may be a new 
maricer associated with transformation of prostate cells 
and tumorigenesis. 

We observed that PSCA protein and mRNA are highly 
expressed in a la^e percentage of human prostate cancers, 
including advanced, poorly differentiated, androgen- 
independent and metastatic cases. Fluorescence-activated 
cell sorting and confocal/ immunofluorescent studies 
demonstrated cell surface expression of PSCA protein in 
Pea cells (9]. Our IHC expression analysis of PSCA shows 
not only cell surface but also apparent cytoplasmic stain- 
ing of PSCA protein in Pea specimens (Figure 1). One pos- 
sible explanation for this is that anti-PSCA antibody can 
recognize PSCA peptide precursors that reside in the cyto- 
plasm. /Use, it is possible that the positive staining that 
appears in the cytoplasm is actually from the overlying 
cell membrane [5]. These data seem to indicate that PSCA 
is a novel cell sur&ce marker for human Pea. 

Our results show thai elevated level of PSCA expression 
correlates with high grade (i.e. poor differentiation), 
inaeased tumor stage and progression to androgen-inde- 
pendence of Pea. These findings support the original IHC 
analyses by Gu Z et al (9 ], who reported that PSCA protein 
expressed in 94% of primary Pea and the intensity of 
PSCA protein expression increased with tumor grade, 
stage and progression to androgen-independence. Our 
results also collaborate the recent work of Han KR et al 
[10], in which the significant association between high 
PSCA expression and adverse prognostic features such as 
high Gleason score, seminal vesicle invasion and capsular 
involvement in Pea was found. It Is suggested that PSCA 
overexpression may be an adverse predictor for recur- 
rence, clinical progression or survival of Pea. Hara H et al 
used RT-PCR detection of PSA, PSMA and PSCA in 1 



ml of peripheral blood to evaluate Pea patients with poor 
prognosis. The results showed that among 58 PCa 
patients, each PGR indicated the prognostic value in the 
hierarchy of PSCA>PSA>PSMA RT-PCR, and extraprostatic 
cases mih positive PSCA PGR Indicated lower disease-pro- 
gression-free survival than those with negative PSCA PGR, 
demonstrating that PSCA can be used as a prognostic fac- 
tor. Dubey P et al 112) reported that elevated numbers of 
PSCA + cells correlate positively with the onset and deveU 
opment of prostate carcinoma over a long time span in 
the prostates of the TRAMP and PTEN rnodels com- 
pared with its normal prostates. Taken together with our 
present findings, In which PSCA Is overexpressed from 
HGPIN to almost frank carcinoma, it is reasonable and 
possible to use increased PSCA expression level or 
inaeased numbers of PSCA-positive cells in the prostate 
samples as a prognostic marker to predict the potential 
onset of this cancer. These data raise the possibility that 
PSCA may have diagnostic utility or dinical prognostic 
value in human Pea. 

The cause of PSCA overexpression in Pea is not known. 
One possible mechanism is that it may result from PSCA 
gene amplifieation. In humans, PSCA is located on chro- 
mosome 8q24.2 fi], which is often amplified in meta- 
static and recurrent Pea and considered to indicate a poor 
prognosis J 13-15]. Interestingly, PSCA is in close proxim- 
ity to the c*myc oncogene, which is amplified in >20% of 
recurrent and metastatic prostate cancers [16,17]. Reiter 
R£ et al 1 18 1 reponed that PSCA and MYC gene copy num- 
bers were eo-amplified in 25% of tumors (five out of 
twenty), demonstrating thac PSCA overexpression is asso- 
ciated with PSCA and MYC coamplification in Pea. Gu Z 
et al (9] recently reportcted that in 102 specimens availa- 
ble to compare the results of PSCA immunostaining with 
their previous mRNA ISH analysis, 92 (90.2%) had iden- 
tically positive areas of PSCA protein and mRNA e3q>res- 
sion. Taken together with our findings. In which we 
detected moderate to strong expression of PSCA protein 
and mRNA in 34 of 40 (85%) Pea specimens examined 
simultaneously by IHC and ISH analyses, it is demon- 
strated that PSCA protein and mRNA overexpressed in 
human Pea, and that the increased protein level of PSCA 
was resulted from the upregulated transcription of its 
mRNA. 

At present, the regulation mechanisms of human PSCA 
expression and its biological function are yet to be eluci- 
dated. PSCA expression may be regulated by multiple fac- 
tors (18]. WatabeTet al [19| reported that transaiptional 
control is a major component regulating PSCA expression 
levels. In addition, induction of PSCA expression may be 
regulated or mediated through cell-cell contaci and pro- 
tein kinase C (PKC) (20). Homologues of PSCA have 
diverse activities, and have themselves been involved in 
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carcinogenesis. Signalling through SCA-2 has been dem- 
onstrated to prevent apoptosis in immature thymocytes 
[21]. Thy-1 is involved in T cell activation and transducts 
signals through src-lLke tyrosine kinases [22]. Ly-6 genes 
have been implicated both in tumorigenesis and in cell- 
cell adhesion [23-25]. Cdl-cell or cell-matrix interaction is 
critical for local tumor grovnh and spread to distal sites. 
From its restncted expression in basal cells of normal 
prostate and its homology to SCA-2, PSCA may play a role 
in stem/progenitor cell function, such as self-renewal (i.e. 
anti-apoptosis) and/or proliferation [Ij. Taken together 
with the results in the present study, we speculate that 
PSCA may play a role in tumorigenesis and clinical pro- 
gression of Pea through affecting cell transfonnation and 
proliferation. From our results, it is also suggested that 
PSCA as a new cell surface antigen may have a number of 
potential uses in the diagnosis, therapy and clinical prog- 
nosis of human Pea. PSCA overexpression in prostate 
biopsies could foe used to identify patients at high risk to 
develop recurrent or metastatic disease, and to discrimi- 
nate cancers from normal glands in prostatectomy sam- 
ples. Similarly, the detection of PSCA-overexpressing cells 
in bone marrow or peripheral blood may identify and pre- 
dia metastatic progression better than current assays, 
which identify only PSA-positive or PSMA-positive pros- 
tate cells. 

In summary, we have shown in this study that PSCA pro- 
tein and mRNA are maintained in expression from 
HCPIN through all stages of Pea in a majority of cases, 
which may be associated with prostate carcinogenesis and 
correlate positively with high tumor grade (poor cell dif- 
ferentiation), advanced stage and androgen-independent 
progression. PSCA protein overexpression is due to the 
upregulation of its mRNA transcription. The results sug- 
gest that PSCA may be a promising molecular marker for 
the dinical prognosis of human Pea and a valuable target 
for diagnosis and therapy of this tumor. 
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Abstract 

'tanshtron biHIatiott id regulated in resiKmse to 
mitrfent nvan^ybllHy and mttogenio stiimif atkm and Is 
coupled wfth cycle pnigrea^on and cell gmwth. 
Seveitf aKensrtiond in tran^ationtf Gontral occu^ 
cancer. Variant mRNA eeciuenees G«i alter the 
tanstofidiial emcieflGy of IndMdiid 
wMch In turn irtay a role In cancer Irfolcgy. Changea In 
the exprosskm or aviOablliiy of ccMnponentB of ttie 
tranelstiomtf machine and En the activation of 
translation through signsd transduction pathways can 
lead to more glottal dttutges. audi as an Inctease bi 
the overAl rate of protein synthesis and translatiomd 
acflvalkm of the mnM molecules Ihv^v^ 
growth md profif eratlon. We revfew the tiasic 
principles of translational control, the alterations 
encountered In ^oicw, and selected thersfto 
targeting tran^atlon Initiation to help elucidate new 
therapeutio avMiies. 

introduction 

The ftindamental principle of molecular therapeutics in can- 
cer Is to exploit the differences in gene expression b^een 
cancer cells and nonral cdls. VMth the advent of cDNAan^ 
technology, most efforts have concenbiated on Identifying 
cflfferencesln gene expression at the level of mRNA, which 
can be attributable either to DMA ampUflcaflon or to differ- 
ences In transcription. Gene expression Is qxjSlte complicated, 
however, and Is also regulated at the level of mRNA stabHiV. 
mRNA translation, and protein stabHIty. • 

The power of translational regulation has been best reoog- 
nfized among developmental blologtet^ because transcrl^on 
does not occur In eaV enibiyogenesis In eukaryotes. Fdr ex- 
ample, In Xem^, the period of transcriptional quiescence 
continues unffl fl» embryo reaches nOdbiastuIa tnan^don, the 
4000^ staga Therefore, att necessary mRNA molecules are 
transcribed during oogenesis and stockpiled in a tnanSlatidnal V 
Inacth^e, masked fbnm. The mRNA are trenslatlonaB/ activated 
at appropriate tinries during oo(^ maturatton, fertiilzB^ 



eatVjBnfib^^ ml thus, are under strict translational 
control. 

Translation has an established role In cell ipowth. Baal- 
c^ly, an Increase In proton synthesis oocure as a conse- 
quence of mitogenesls. Until recently, however, little was 
known about the alteretlbns In mfU^ transit 
and mudi is yet to be discovered about their n^e In ^ 
development and progression of cancer. Here we raylew the 
basic princ^es of transiatk>na] control, the atterstlons m-- 
oountfiTBd In cancer, and seleciBd 1henE«>le3 targetir^ tR^ 
tionMtiatk>n to elucklate potential new therapeutioa 

Basle Principles of Translation^ Control 
MechanlamofTmslalionlnfOoUon 
l)aisla!k)n intlialkin is the mal^ 
Ttanslatkm Mtiattoii Isaosn^proce^lnwhl^ 
tRNAandthe406and60S rax>soma{8ubunlt8arerecniftedto 
the 5' end of a mlRNA molecule otkI assembled t)y eukaryotic 
tran^atkm irtftlalkm fectore Into an SOS rib^ 
codon of ttie mWA (Fig. 1). TYie 5' end of eukaryxA mRNA Is 
capped, £a« conlafris the c^ stnjcture m^GpppN (T-in^hyl- 
guanoslifie-triphospho-5'Hibbnucleoski^ Most tiaisiatiOn In 
etdcaryotes occurs In a capHtependent fat^^ 
spedficafly reoogrted fay the etF41&^ whteh bihds the 5' cap. 
The elF4F translation Inlliation complex Is then fb^^ 
assembly of eiF4E, the RNA heOcase'elFtt, and elF4Q, a 
scaffbiding protein that rnecflates the bind^ of the 4^ 
somal subunit to the mRNA mdecule through Interactkm wi^ 
the eiF3 protein present on the 40S ribosoma eiF4Aand eiF4B 
partte^e Ifi meHhg the secondary sinjoiure of the 5' UTR of 
the mRNA. The 43S Initiation complex (40S/elF2/M^-tRNV 
GirP comple}0 scans the mRNA in a 6'->3' (firectkm unii it 
encounters an AUQ start oodon. This start oodon Is then base* 
paired to the andoodon of Initiator tRNA, fomilng tiie 48S InHI- 
atkm complex. The biitlatlonliactorB are then displaced ft^ 
4dS complex, and the 60S ribosome Joins to tbrni the BOS 
rilx)80in& 

UnHka most eukaryofc translation, trar^atlon tnitlafion of 
certain mRNAs, such as the pkx}maA/lnjs RNA, Is cap inde- 
pendent and occure ty fritemal rftx>some entiy. TWs mecha- 

riism does not raqi^elF4E EHher the 43Soomito can bind 
the Inltiatfon oodon directly through'ktteractfon the IRES in 

the 5' LJm such as in the encephstonryocardftis vims^ or it ca^ 
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Wtlattefi In dukaiyotes. TN 4E-BP8 era hyperphos^ 

22.^^SS^J5i*!li£JS"^ W elF3. and the temaiy complex 
^^S^i^BUfUS^^^^'^ una an AUQ start oodon 

raieasedi and me large ribosomal siri>unR to looni^ 



InWafly attach to the IRES and then 

scamiing or transfer, as Is the case vkrfth the pdlov^ 

Regulation of Translatfoii Initiation 
Translation Inftfatlon Can be regulated by alterations In the 
expression or phosphorylation status of the various ftutos 
Ihvpfved. Key components In translatlonal regulation that 
may provide potential therapeutic targets Ibllow. 

ell^ eIRE playa a central role In translation regulation. 
It Is the least abundant of the Initiation factors and Is con- 
tidered the rate-llmltlr^ component for Initiation of cap- 
dependent translation. elF4E may also be Involved In mi^ 
spHdng, niRNA 3^ (vocessing, and mRNA nudtecyloplas- 
mic transport elF4E expression can be Increased at the 
transcrtptfonal level In rasponse to seium or growth Itetors 
(3). elF4E overexpresslon nnay cause preferential translation 
of mRNAs containing exces^e secondary structure m theft- 
s' UTB that aro rwnfnalV discrlmlnatod against 1^ the tran^ 



latlonal machinery and thus are &iefflclentiy translated (4«-7>. 
Asexamf^esoftiys, overex|M^»sIon of elF4E promotes la- 
creased translaUdn of vascular endotti^Ial growth fector 
ftolxJblast growth liBctor-2, and cyclln Dl 

Another mechanism of control is the regtjiatlon of elF4E 
pho^or^atioa eli% phosphorylation Is mediated by the 
nrtltogen-acttvated protein Idraee-interacting 1^^ 
Is activated by the mttogen-activcded palhw^ motivating 
Qxtrecellular s^nal-rtfated idnases stkI the strass-ectivatacl 
patiiway acting through p3B nrdtogeivacUvated protein Id- 
nase (10-^13). Several mitogenst such as seiump fidatelet- 
derived growth factor* ^jldenmal growth fector, teulfr^, 
Lotensin II, src kinase overexpresslon, and ras over- 
expression, lead to eIF4E phosphorylation (1^ The phos- 
phorylation status of elF4E Is usually conflated v\rttii the 
translatiorial rate and growth status of tlie ceil; howevBr? 
e{F4E phosphorylation has ato been observed bi response 
to some cellular stresses when translational rates actually 
decrease (1^ Thus, lUrtiier study is needed to understaRd 
ttie effects of eIF4E phosphorylalfcm on elF4E actiNrfty. 

Another mechanism off regutertion Is the alteration of elF4e 
availabflHy by tite binding of eIF4E to ttie eiF4&bindIng pro- 
tdns (4E-BP, also l<nown as PHAS^ 4E-BPs compete with 
elF4Q fbr a bimfttg site In elR£ The binding of elF4E to the 
best characterized e}F4&bindIng protein, 4E-BP1, b regih- 
latedby4E^1 plK>s|Aoiylitfioa i4yp(Y)ho^>horylated 4E- 
Bl»1 binds to eiF4^v^iereas4E-8l>1 hyperr^io^^iorylation 
decreases this tiindlng. Insulin, angiotensin, hernial 
gnwrth factor, plateiet-derived 9^owtti factor, hepatocyte 
growtii factor, nerve growtiifEntor, lnsulln-tikB£powtti factors 
I and li,*lnterteuldn 3, ^anulocyte-macrophage ooioriy-ettrn- 
ulating factor + steel factor, gastrin, and the adenovbiis have 
all been reported to induce phosphorylatton of 4E-BP1 and 
to decrease tiie ability of 4E-BP1 to bind elF4E (15, 
Conversely, d^arlvatfon of nutrients or growth factors results 
In 4&BP1 dephosphorylation, an hcrease in eiF4E binding, 
and a decrease In cap-dependent transiatioa 

p70S6 Kinase. Phosphorylation of ribosomal 408 protein 
Sd by S6K is thought to pii^ an important roielntransiatfonat 
regulation. S©K mouse einbryonic cells proBferate morQ 
slowly than do parental ceils, demonstrating tiiat S6K has a 
positive Infhjence on ceH proiifertttion (17). S6K regulates ttie 
translation of a group of mRNAs possessfrig a 5' terminal 
oilgopyrlnMnetract 05'TDP)found attt» 5' umof rft^^ 
protein mRNAs and other mRNAs coding fw components of 
«» translational machinery. PhosphorylaMon <rf S6K Is regu- 
lated in part based on the avaSabPlty of nutrients (18, 19)andis 
stimulated several growtii fiactws, suc^ as pta^ 
growtii factor and Insulin^ growtti factor I {20)^ 

eiF2a Pho^>horylation. The binding of tiie Initiator «^ 
to tiie small ribosomal unit Is mediated by translation Inltia- 
tfon factor elF2. Pho^>horyfation of the (»^buhlt of eii=2 
prwwrtsfbrni^^ 

inMbits global protein symhesis (21, 2^ eR=2a is i^iosi^ 
rylated undor a variety of conditions, such as vtrel Infection, 
nutrient deprivation, heme deprivation, and apoptosis (a^! 
elF2a Is phosphoryf ated by heme-ragulated inhlbHor, nutrient* 
regulated protefri Idnase, and the IFN^nduced, double- 
atoanded RNA-acUvated protein Idnase (PKR; Ref. 23). 
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Tlie mTW SgnaOng Pattiwqy. The macr^ 
rapamydn {^ra&nus; Wyeth-Ayerrt Research, CoDegevflle* 
RA) fm been the mjbject of intensive study because It In- 
hlhrn signal tmnsducUon pathways fnvoived In T-oeO activa- 
tfon. Be rapamydn-sensftive oomponwit of these pathways 
IsmTOR {also caned FRAP or RAFT!). mTOR Is the mam- 
malian homoiogue of the yeast TOR proteto that regulate 
ptigresslon and translation In response to nutria ovallabir- 
Ity mTOR Is a serine-threonine kinase that modulates 
translation lrmiatk)n by aHerfng the phosphoiyiallon status 
4E-BPi and S8K (FJg. 2; Ref. 2^. 

4&BP1 IsphosplKaiyl^onmuRiplerBsfcJiis&mTORpho^ 
phoiylates the Thr-37 and Thr^ residues of 4&BP1 Ai v*D 
@69; however, phosphoryl^on at these sHes is not asso(^^ 
wHh alossof elR£tMlng.Ph08phoiylatfonofllv^ and 
lhr-46 is reqiAiad for subsequent ph06|4iorylatlon at sevend 
COOHrtennlna), serum-senalHto sites; a oonMnatlon of these 
phosphofylation events appears to be needed to lifihlbit the 
• blndlr»gof4eBP1lDdF4E(26),Thepfoduclofthei47Mgene,. 
P38/MSK1 pathway, and protein Kinase Ccr also play a rob h 
4&BP1 phosphoryiatfon ^7-2^. 

S6K and 4E-BP1 ara also regulated, In part, by PI3K and Ite 
downstream protein kinase Akt PTBM Is a phosphatase that 
negatively regulates PI3K signaling. FTEM nuO cells have 
oonstltutfvely active of Akt, with Increased S6K actMty and 
S6 pho^phoryfatkm (3(9. 88K activity is Inhibited both by 
PI3K Inhibitors v«)rtmannln and LY2M002 and by mTOR 
InWbltor rapamydn ^4). Akl phosphorylates Ser-244a In 
mTOR in vsttto, and this site Is pho^horylated upon Akt 
actlvatlbn In \^vo (31-63). Thus, mTOR Is regulated by the 
PI3K/Akt pathway; however, this does riot appear^ be the 
only mode of regulation of mTOR activity. Whether the P13K 
pathway also reguiatae S6K and 4E-BP1 phosphorylatbn 
Independent of mTOR Is oontrDveraial. 

InterBStlngly, mTOR auloj^Kwphofylatk>n is blocked by wort- 
mannfri but not by rapamycfn ^ This seemfe^ frwonstetency 
suggests that mTOR-rssponshfe regulation of 4E-BP1 and SeK 
activity occumthnoughamechanism other than InWnste^ 
kinase acth%. An altemata pathway tbr4E-BP1 and S6K ph^ 
phorylalton ty mTCW acth% Is by the InhlWtlon of a phospha- 
tasa Trsatmentwtth calyculin A, an Inhibitor of phosphatases 1 
and 2A. reduces rapamydn-lnduced dephosphorytatfon of 4E- 
BP1 and S6K by rapamydn (35). PP2A Interacts with tulHenglh 
S6K but not with a S6K mutant that is resistant to dephospho- 
rylation resulting fnsm rapan^da mTOR phosphorylates PP2A 
^ uto; however, how this process altm PP2A adiWty is not 
knowa Ihese results are consistent vdth the model that phos- 
phorylation of a phosphatase by mTOR prevents dephoepho- 
rylatton of 4E-BP1 and S6K, and conversely, that nutrient dep- 

r^^n and rapaniydn bkiok InhHjItion of the phosphatase fay 
mTOR. 

Potyaderviailon. The po^A) tall In eukaryotic mRMA Is 
Important In enhandng translation Inltlatfon and mRNA sta- 
bIBty. Pdyadenyfatlon plays a k^ rda In regulating gene 
expression during oogenesis and early embryogenesls. 
Some mRNA that are transterttonally Inactive in the oocyte are 
polyadenylated ooncomttantly with IranelatkHiaf activation In 
oocyte matuiBtlon, whereas other mRNAs that are transla- 
Honally active during oogenesis are deadenylated and trans- 
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laUonally sllenoed (36-38). Thus, control of po!y(A) tan syn- 
thesis Is an Important regulatDiy step In gene expressloti. 
The 5' cap and poiy(^ tan are thought to function syneigls- 
tically to regulate nnRNA iranslatkmal effidemy (39, 40). 

RHAPacko^g. Most RNA-blndIng proteins ae assem- 
bled on a transcript at the time of transcriptloh, thus deter- 
mining the translatlonal fate of the transcript (41). A Wghiy 
conserved family of Y-box protdns Is found in cytoplasmic 
messei^ ribonudeoprotein particles, where the proteins 
are thought to play a role In restricting the racniltment of 
mRNA to the tnanslatkHial machinery (41-43). The mator 
mRNA-assodated protein, YB-1 , destabOtees the Interactfon 
of elF4E and the 5' mRNA cap In W&o, and overexfnesslon of 
YB-1 riBsuits In translatlonal reprasston *i vfvo (4fl. Thus, 
alterations In RNA packaging can also play an bnportant role 
In transIatkHial regulation. 

tiranslatton Alterattons Encountered In Cancer 

Three rnah atleratkms at the translatlonal level occur In cancer 
varlatfons In mRNA sequences that Increase or decrease trans^ 
latk)nal effldency, changes In the expresston or avallabDi^ of 
components of the translattonal machinery, and acQvatkm of 
transtatkm through aberrantly activated signd transduction 
palhwi^ Ihe first alteratk)n affects the translatbn of an tndl* 
vWuai mRNA that nrwy play a role In cadnogene^ The seo- 
ond and th W alterations can lead to more global changeSk such 

as an Increase In the overaH rate of protein synthesis, and the 
trenslational activation of several mRNA spedes. 

VartaUonslnmRHASequenGO 
Variations In mRNA sequence affect the transIatkHial effi* 
demy (rfthe transct^Abritf descr^rtton of these variatkms 
and examples of each mechanism follow. 

Mulationa. Mutattons in the mmA sequence, espedaliy 
In the 5' UTR, can alter Its transIatkHial effldency, assaenin 
the ftaUowIng examples. 
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Saftoet-aLpropQised that transit 
fs repressed, whereas bi severelBuridtt tymptKmtas 
that haw dei^lonsof thomlWAS' Ln^t^^ 
is nxie efffetent (4Q. More recently, it was re^^ 
5' trm of o-irot? contains an IF^ ami thus c^n^ 
tlon can be InftEated by a cap-Independent as weO as a 
capKiependent mechanism (40. 47). In petSents wtt^ 
n^eloina, a C-»T mutation In the o-mye IRES was Identlff ed 
(M) aiKifound to cause an enhanced Mflation of translati^ 
via Infernal rfbospmal entry (4^ 

8m4f. Aaomatic point mutatkm (117 Q-*C) m portion 
-3 with respect to the start codon of the emiAt gene was 
Uentifled In a highly aggnasalve sporadic breast cancer ^ 
Chlmartc oonstructe consisting of the wlld4ype or mutated 
BRGAI 5' UTR and adownatrssm fudferase rapoiter dem- 
onstretedadecraaseinthetranslattond] efffe^^ 
UTR mutation. 

CyeSn-iiepwtdent Kln^ Inhnatar 2A. Some inherited 
maianma kindreds have a Q->T transverslon at t>ase -34 
of cycGiMiopendent kinase lnNblto^2A, whfeh encodes a 
OMinKiependent kinase 4/Gyc»n-d^)endent kinase 6 kbiase 
inhibitor Important In % checkpoint regutotion {51). This 
mulatkM\.glves rise to a novel AUG transIatkMi Initlatkm 
codon. creating an upstream open reading frame that com- 
petes for scanr^ ribpsomee and decreases transi^km 
ftom the wlkl-type AUa 

AHamate Splicing and Aitemate Transcription Start 
Sftea. AtoHtfons in splteing and aitemate tranacrtptton sites 
can lead to varftftons in 5' um sequence, ten^ and se^^ 
aiy strucawe^ ulBmately frnpacting translaltonal ^f^^ 

ATM. The ATM gene has four nonooding axons to its 5' 
Ura thrt undergo extensive alternative spBdng The 
contents of 12 different S' LTTRs that show cor^derabto 
dfverBfty In length and sequence have been Mentffled. ITiese 
diwgent 5' leader sequences play an Important role In the 
translatkmal regulatkHi of the ATM gene. 

iiKfta lnaaut)3etoftumors,overaxprGs^oftheorm- 
proteln mdm2 results tn enhanced translatkm of the mdhi2 
mRNA. Use of different promoters teada to two mtfm2 tran- 
scripts that differ only In their 5' leaders (53J, The tonger 5' 
UTR contabis two upstream open reading tame% and this 
mRIMA is loaded vidth riboeomes Intently compared with 
the short 5' urn. 

BOCAI^ In a normal mammary gland, BfK^I mmA Is 
expressed with a shorter leader sequence ^'UTRcO, whereas 
In sporadic braast cancer tissue, BRCA1 mRNA is expressed 
with a longer leader sequence (5' UTRb); the tian8latk)nal 
efficiency of transcrtpta containing 5' UTRb Is lOtlmes lower 
than that of transcripts oonlaining 5' UTRa (54). 

n^P^ 7^^mRNAindudssa1.1-kb5'UTR.whfeh 
exerts an inhlbltoiy effect on translatkm. Many human breast 
cancer Ones contain a novel TQF-p3 trar^cript with a S' 
UTR that is 870 nucleotides shorter and has a7-ib(d greater 
translattonai efficiency than the nonnal 7^/33 mftMA (55). 

Aitemate Pofyadei^ylaflon Sites. Multipto polyadenyi- 
atlon signals leading to the generatton of several transcripts 
with differing 3' UTR'have be&n described for several mRNA 
species, such as the Rarproto^cogene^. ATM gene 
^» tissue Inhibitor pf metalloproteinases^ (57), RHOA 



piotxHNicogene C58). and caimoduUn-l (S9). Although ttie 
effect of these altemale 3' UTRs on translation Is not y&t 
known, they nuy t>e important In RNA-proteIn biteractions 
that affect translatldnid recmitment The role of these alter-^ 
altons in cancer deveh3pment and progression Is unknown. 



TmnsbUon Machhrnry 

AHerattons In the compon^ of tmnsiaUon macMneiy can 
take many fonns. 

Ovoraxpressslon cH elF«s. Overexprassten of elF4B 
causes malignant transfonnatton in rodent ceite (6Q and the 
deregulation of HeU ceU growth (61). Pokmovsf^ ef aL (62) 

found that elF4E overaxpressksn substllutes for serum and 
Individual growth faaim In prasen/lng viabBHy of fibroblasta. 

whteh suggestathatelREcan mediate both proSfeiaihre and 
swvtval signaling. 

Bavated tevels of ell=4E mRNA have been found In a broad 
spectrum of transfbnned cefl lines (BS). alF4E levels ai^ 
elevated bt all ductal cardnoma in shu specimens aruJ inva- 
sive ductal carcinomas, compared vrfUi benign breast spec- 
imens evaluated with Western btot analysis (64, 6^ Pieiini* 

Inary studies suggest that this overeKpresskx) Is atlrftHitable 
to gene ampHfteation (6^. 

There are accumulatfng data suggestb^ that eff% oveioc- 
presskm can bo v&luable as a pragnostte maitor. elF4E over- 
e9(r»e8sion was fbund h a refrosptt^ study to be a rnailcer ^ 
poor prognosis In stages I to III breast candnoma (67). Vari^ 
ton of ti» pro^iostte vakie of elF4E fn breast cancer Is now 
under way In a prospective trial (67). However, In adifferent 
study, elF4E expression was conelated with the aggressive 
behavtor erf non4todgk^'s lymphomas ^ bi a prospective 
ana^ofpaHehte with headend neck cancer, elevated levels 
of elF4E in hlstotogbxdVtumorlneesuii^^cal margte predicts 
a signifteantly inoeased risk of kxakegtonal recunence (0). 
These resuite an suggest that eIRE overaxpresston can be 
used toselectpatfente who mtgmbaieft from more agges^ 
systemb therapy. Rjmiennore, the head and neck cm» 

suggest that eIREoverBxpresslon is afleki defect and can be 
used to guUe k)cal therapy. 

Alterations to Other Initiation Factors. Alterations In a 
numb^' of other Initiation factors have been associated witti 
cancer. Overproductton of eIF4Q, similar to eiF4e» leads to 
malignant transformation In vftro (60). eIF-2a Is found tn 
increased levels in bronchioioaiveolar cardncmaaof thelung 
(3).lnltlattonfectoreiF^1 is overaxpressed In melanoma 
(Tp) and hepatocenular carcinoma (71)- The p40 subunit of 
translation InltiaUon factor 3 is amplified and overaxpressed 
in breast and prostate canQer(72), and the elPSiJi 10 subunit 

is ovarexprassed in testteuiar seminoma (73). Ihe role that 
overexprssslon of these Initiation fectors plays on the devel- 
opment and progresston of cancer, if any, Is not known. 

Ovarexpresdon of 86IC S6K Is amplified and highly 
overaxpressed In the MCF7 breast cancer cell line, com- 
pared with normal mammary epithelium (7^. In a study by 
Barlund etal. (7^, was amplified In 59 of 668 primary 
breast tumors, and a statistically significant association was 
obsenred between amplifteatlon and poor prognosis. 



OveraxpMSslonof PAP. PAP catalyzas 3' poly W fiyn- 
ihesb. PAP Cb werexpressed bi human cancer oeils com- 
pared wWi nomial and vbaUy tiarsfomied cdis (T?). PAP 
enormatfc activity tn breast tumoia to 
PAP protein tevBb (7Q and, in mammaiy tunorcytoaobk was 
fewidlDtjoanrndependentfectorfbrpmdi^ 
Uttle b knom however, about IHW PW 
My affects the transtational profile. 

Aft0ratfon8inRNA-bfaKlIn{|Pff0te!it& Even less is known 
about aftefBtons \n packaging In cancer Sncreaded ex- 
piessbn ami nudaar tocaOzatkm of ttie RN^^^ 
Y&-1 atefaKficatoisorapoorpro^iodsfbrfaic^ 
noo-«naB(»Bli*ig cancer {7^ and ovarian cancer (rS).Ha^ 
ew, this eRM may be medlaiked alt least fri part at the 1^ 
transcription, because YB-1 Increases chemoTBslstance tiy en- 
handng the irenscrfptkan of a multMhio re^stance g 



Aotlwsflfcm of algnd tnGBisduc^ patfiw^ b^^ 
suppressor genes or oveie)q3ies^ of oertafntyro^ 
can contribute to the grovvth and aggressiveness of tbmors. A^ 
In4»rtan^.nnutant In human canceiB is the tumor suppressor 
genePIBV, virtM leads to the acfivatk>n of ^ Pt3K/A^ 
WSV, Activatbn of Pt^ and Akt Induces the onoogente tnara- 
fbrnwtlon of ctftdm embiyD«xoblas^ 
show constitutive phosphorylatton of S6K and of 4&BP1 
A mutant AM that rotate kinase activt^ but does not pho^ 
phoiylate S6K or 4&BP1 does not tiansfonn fibroblasts, 
sii^ests a conrelatkm between the oncog^A^ of PI3K and 
Akt aiKl the phosphorytatkm of S8K and 4E-BP1 

Several tyrosine kinases such as platelet*derived grow»i 
fetctor, bisulIn-Oke growth t^ictor, HOns/heu, and epUennal 
growth lector receptor are ovaraxpressed in cancer. Be- 
cause these kinases activate downstream signal transduc- 
tion pathwe^ known to alter translatton InlUaUon, acUvatkm 
of tnEuislatton is llk^ to contribute to the grovvth and aggres* 
sh/eness of these tumors. Furthermore, the mRNAItor many 
of these kinases themselves are under translational control. 
For example, HER2/neu mRNA is translatkmalV controlled 
both by a short upstream open reading frame that represses 
HERZrneu translatton In a ceO t^lndependent manner and 
by a distinct cell type-dependent mechanism that Increases 
translational efftetency (82K HER2^eu translaflon Is different 
In tmnsfonmed and normsd cells. Thus, It Is possible that 
altmtions at the translatfonal level can in part account for 
the discrepancy between H£A^nei/ gene an^iHflcatkxi de- 
tected by fluorescence In sAu hybrfdteatk)n and prot^ levels 
detected liy Immunohlstochemlcal assays. 



lYanslatfon Targets of Selected Cancer Therapy 
Components of the translation machinery and signal patti- 
ways Involved In the actlvatton of translation kiitlalkm repre^ 
sent good targets for cancer therapy. 

Tarjgeting tf»e niTlOl? Signaling Pathway: Bapamydn 
and TUntstatln 

Rapamydn Inhibits the prbltferatkm of lyinphocytes. It was 
Initially developed as an Immunosuppressh/e dn^ for organ 



transj^antatton. Rapamydn wW) FKBP 12 (FK506*bindIr»g 
protein, Mr 12/)00) to mTDR to Inhibit its ftjnctfon. 

Rapanriydn causes a ainall but alj^lflcant reductkui In the 
Initlatkin rote of protein ^nthesls (89). It bk)ok8oengro^ 
part by bk)Cktng 88 phosphorylaHon and edecUvely 

press&igthetran8raBon<rf5'TDPmRNA8,suchasribos<Mmal* 
protdns. and ek)ngatlon foctors Rapamydn also 

bk)ck8 4E-fiP1 phosphorylatl(m and Inhibits capHdepehdem 
but not cap-Independent translation (17» 
11)0 r^sairycln-aenslthtt dgnal tra^ 

vated during riwllfflttutttranslbrmatkin and cancer progr^storv 

b now bisfrig studied as a tag^ fcr cancer ther8|]y (B^ 

tate, braasS, snriafl oeB lung, gOot^astoma. ni)elanonra» and T<iea 

leutento are among the cancer Bnes most sensitfve to the 

rapamydn anak)gue CCI-779 (VVVetivAyerst Resead^ Reff. 

87). rnrhedxton^yoo6aioornaodllkies.rapanrvdni8em^ 

statteorcyiockM depencfing on the p63elatus ofthecefe pS3 

wlki-^ eels treated wfth raparn^^ arrest In the ^ 

and rnalntain their. vlabSlly, whereas p63 rmitant cells 

late InGi and undergo apoptosis ^ 8$. In a recently raportad 

study (»tng human priniittve neuroeotodermai tumor and 

meduilobfastoma models, rapanntydn eodi!b)ted more cytotox- 

b%hcontolrcrtion with dqplatin and c a i n^ 

dngle agent A9 ito, CCt-779 delayed growth of xenogri^ 

180% afterl weekofthere^jy and 24096 aftor2weeks.A8lr9l9 

hi8h<tose adnMstrdkm caused a 37% decraase h tumor 

vofuma <^owth Mt)ltfc)n 

dsr^sdh b combtnalk)n v^ CCt-77g t^ 

^ Thus, precilnk»l studies suggest tfiat rapantydn ana- 

k)gues are useM as sbigte agents and ki oombinatkm wHh 
cherrxytherapy. 

Rapan^cbi analogues GCI-TTd and RAD001 (f^tovartls, 
Basel, SwHzartancO are now In dintoal trials. Because of ttie 
knovm effect of rapamydn on lym^^xx^yte prdiferatton, a 
potential protriem with rapamydn analogues b Imnninosup* 
presdon. Howevar, eJthoii^ prok>nged invminosupprssstori 
can result from rapamydn and CCI-778 admlnlsterad on 
continuous-dose schedules^ the immunosuppressive efVacte 
of rapamycbi anatogues resolve In ^ h after ftmpy 
<91). The prindpal toddties of CCI-779 have induded der- 
matdogk^ toxicity, myetosupprasdon, Infectton, mucositis, 
dianhea, reversible elevations In Uver function tests, hyper- 
glycemia, hypokalemia, hypocak»mla, and depression 07, 
92-G4). Phase II trials of CCI-779 have been conducted In 
advanced renal cell cardnonrta and in stage Wti breast 
carcinoma patients who Med with prior chemotherapy. In 
the resdts reported In abstract fonn, althou^ there were no 
complete responses, partial responses wae documented in 
fcx)th renal cell cardnoma and In breast carcinoma (94, 
Thus, CCI-779 has documented preliminary didcal aotMty h 
a previously treated, unsdeded patient populatton. 

Active Investigation ia underway Into patient sdectionlbr 
mTOR inhlbitora. Several studies have found an enhanced 
efficacy of Ca-779 In PTEN-null tumors (30, 96). Another 
study found tfiat sbc of dght breast cancer cetl lines were 
rssponslve to CCI-779, dthough only two of these llnea 
lacked PTHN (97) T!>«e was, however, a positive conelaUon 
between Akt acthration and CCI-779 sendttvity (97). This 
eonrelatkm suggests that adivalton of the PI3K-Akt pathway, 



Ttoisbaon tftiBatkw In Oancy 



ragardle8s<tf whefthor K is attributable to a PT^ 
to ovenaxprssslon of receptor tyrosine Wnases, makes can- 
cer cell amenable to mTOR-dlfected thera^. In contrast, 
lower lavelft <rf the target of mTOR. 4E-BP1. are asaoda^ 
with rapam/dn resistance; thio, a lower 4&BP1/elF4E ratio 
may predict rapairiycin resMance 

Anotho- mode of actMty to rapinydn and Its analogues 
appears to bo through Irmibltion of anglogenesis. lliis actV- 
Ity may be both through direct Inhlbltlpn of endbthefial ceH 
proliferation as a lesuh of mTOR Inhibition In these cells or by 
Inhibition of transtatfon of such proanglogenlc factore as 
vascular endothelial growm in turim cells 99, 10^ 

The angiogenesis Inhibitor tunwtallh. an^^ 
dnrgauTBiitl/ undisr study, was also found recently to Inhibit 
translaticm In endcthellai ceils (101). Ihrough a requldto In- 
teraction }M\ inte^ tumstatin inhibits activation of the 
PJSIOAkt pathway and mTOR In endothenal ceils and pre- 
vents dissociation of eil^ljfom 4E-eP1, thei^ biNbithg 
cap-dependent translation. These findln^^ suggest that en- 

dothe&al cells are espectaQy sensitive to therapies targeting 
the mTOR^naling pathway. 



EPA Is an n-3 polyunsaturated ^ add found fai tte fish- 
based diets of populattons havlr^ a low Incidence of cancer 
{log. BPA Inhibits the prolifenation of cancer cells (103), as 
well as In animal models (1 04, 1 05). It blodcs ceU division tiy 
inhibiting translation initiation (iOS). B>A releases G^-** from 
intraceiyar Stores vvhlle Inhibiting their.refining, thweby ac- 
tivating PKR. PKR. in turn phosph orylates and lnhftrftself=2a, 
resulting In the bihibitlon of prote£n synthesis at the level of 
translation Inftfeilon. SMariy, ctotriirotzole, a potent antlprty- 
ilferative agent fr) vftn? and //J vto, Inhibits cen gro 
depletion of Ca?* stores, activation of PKR, and phoapho- 
rylalion of eli^ (10Q. Consequently, clotitoiazrte piBifere^ 
tially decreases the eKprsssion of orcUna A. E, and D1, 
resMRing bi blockage of the cen cycle in 

mda-7 is a novel tunnor suppressor gene being developed 
as a gene llierapy agent Adenoviral transfer of mda-? (Ad- 
mdaT) Induces apoptosis In many cancer cdls Including 
breast, colorectal, and lung cancer (107-109). AdHnda7 also 
Induces and activates PKR, which ieads to phosphorylation 
of elF2a and Induction of apoptosis (HQ). 

Ravonokls such as genista and quercetin suppress tu- 
mor cell gn>wth. AU three mammalten ell=2a Idnases, PKR, 
heme^ulated Inhibitor, and PERK/PEK. are acttvated by 
navonoids, with phosphorylation of elF2a and inhibition of 
protein synthesis (111). 



Taiyeiinge!F4A and elF4E: Andsense AMA 
andP^ddaa 

Anifeenee expression of elF4A decreases toe prDOfsratfon rate 
of nwlanomacells (1 12). Seque^natton of eIF4E by overexpres- 
slon of 4E-BP1 is pnoapoptotlo and decreases tumorlgenk% 
(113, 114). Reduction of elF4E with antisense RNA decreases 
soft ^ growth. Increases tumor latency, and Increases the 
rales of tumor doubling ttnes 0% Antisense eii=54E IWA treat- 



ment also rsduces toe exprss^ of angtogeryc tetors (11 
and haabeen proposed asapotentialacguvant therapy for head 
and nedc canosr^ particularly when elevated eiF^E b found In 
swgicai nriaigfns. ^nan nrtfiecUte 

4EW 43inding domdn of dl=4E ars proapoplotic (1 1 Q met 
ae tiiso being acUvety pursued* 



Exploiting Setocdva Thuiafotfon for Gene Therapy 

A dfffersm therapeutic epproach that tatea advantage of tho 

enhanced cap-dependent iransialion in cancer oeBs is the use 

Of gene therapy vectore enoodng suicide genes with h^^iily 

etnjctured 5' UTR lliese mRNA vrould th^ 

disadvantage binomial cells and not transtele wen, whereas In 

cancer c^ they woUdtrarelalo mors el!fc*ntV-F^e» 

the inlroduclkm of the 5' um of Ibrobiast growth 

the coding sequence of /le^ sdtpfeoridhis f^ 

Miase gene^ aflowa for seleoltve tradafion of 

vhis (ype-7 tfiymUihe Mnase gene 

Gomparad w»h norrnal momiary ceB linea and resulta in se- 
lective aendflidiy to ganciclovir (117). 



Toward the Future 

Ttanslafion is a cmd^ pnxess In every c^L l^owever, aeve^ 

aRerattons in tnui^ational control occur b) caricer. Can^ 

appear to need an aberrantly activated tnmistertional stale ^ 

survfral, tins aUqwIng the targ^ 

surpri^ngly low toxicity. Oortiponents of toe tfanslational ma- 

cWneiy, su^ as eiF46 and signal transduction pattiw^ trv 

volved In transteWon litfllatlon, such mTOa n^Mesent promldng 

targets for cancer therapy. IrMstos of the mTOR have already 

shown s(»ne preliminary activity in dlnlc^ t^ 

that with the development of better predict inarim and 

b^ter patient selecticHi, rosponse rates to singl&iagent therapy 

can be in^proved SMar to other (^tostattc agerirt^ 

mTOR MHbilDis are most fflcely to achltiM» cOnlcai utiRy to 

conibination tfwrapy. In ttie Interinri, otr Irvcrsasbig underetand- 

Ir^ of tanslation initiation and signal trEnsducdon pathws^ 

promise to lead to the identiflcalion of new therapeuliG targets 
In toe near future. 
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We have determined the relationship between mRNA and protein expression levels for selected genes 
expressed in the yeast Saccharomyces cerevisiae growing at mid-log phase. The proteins contained in total yeast 
cell lysate were separated by high-resolution two-dimensional (2D) gel electrophoresis. Over 150 protein spots 
were excised and identified by capillary liquid chromatography-tandera mass spectrometry (LC-MS/MS). 
Protein spots were quantified by metabolic labeling and scintillation counting. Corresponding mRNA levels 
were calculated from serial analysis of gene expression (SAGE) frequency tables (V. E. Velculescu, L. Zhang, 
W. Zhou, J. Vogelstein, M. A. Basrai, D. E. Bassett, Jr., P. Hieter, B. Vogelstein, and K. W. Kinzler, Cell 
88:243-251, 1997). We found that the correlation between mRNA and protein levels was insufficient to predict 
protein expression levels from quantitative mRNA data. Indeed, for some genes, while the mRNA levels were 
of the same value the protein levels varied by more than 20-fold. Conversely, invariant steady-state levels of 
certain proteins were observed with respective mRNA transcript levels that varied by as much as 30-fold. 
Another interesting observation is that codon bias is not a predictor of either protein or mRNA levels. Our 
results clearly delineate the technical boundaries of current approaches for quantitative analysis of protein 
expression and reveal that simple deduction from mRNA transcript analysis is insufficient 



The description of the state of a biological system by the 
quantitative measurement of the system constituents is an es- 
sential but largely unexplored area of biology. With recent 
technical advances including the development of differential 
dispIay-PCR (21), of cDNA microarray and DNA chip tech- 
nology (20, 27), and of serial analysis of gene expression 
(SAGE) (34, 35), it is now feasible to establish global and 
quantitative mRNA expression profiles of cells and tissues in 
species for which the sequence of all the genes is known. 
However, there is emerging evidence which suggests that 
mRNA expression patterns are necessary but are by them- 
selves insufficient for the quantitative description of biological 
systems. This evidence includes discoveries of posttranscrip- 
tional mechanisms controlling the protein translation rate (15), 
the half-lives of specific proteins or mRNAs (33), and the 
intracellular location and molecular association of the protein 
products of expressed genes (32). 

Proteome analysis, defined as the analysis of the protein 
complement expressed by a genome (26), has been suggested 
as an approach to the quantitative description of the state of a 
biological system by the quantitative analysis of protein expres- 
sion profiles (36). Proteome analysis is conceptually attractive 
because of its potential to determine properties of biological 
systems that are not apparent by DNA or mRNA sequence 
analysis alone. Such properties include the quantity of protein 
expression, the subcellular location, the state of modification, 
and the association with ligands, as well as the rate of change 
with time of such properties. In contrast to the genomes of a 
number of microorganisms (for a review, see reference 11) and 
the transcriptome of Saccharomyces cerevisiae (35), which have 
been entirely determined, no proteome map has been com- 
pleted to date. 

The most common implementation of proteome analysis is 
the combination of two-dimensional gel electrophoresis (2DE) 
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(isoelectric focusing-sodium dodecyl sulfate [SDS]-polyacryl- 
amide gel electrophoresis) for the separation and quantitation 
of proteins with analytical methods for their identification. 
2DE permits the separation, visualization, and quantitation of 
thousands of proteins reproducibly on a single gel (18, 24). By 
itself, 2DE is strictly a descriptive technique. The combination 
of 2DE with protein analytical techniques has added the pos- 
sibility of establishing the identities of separated proteins (1, 2) 
and thus, in combination with quantitative mRNA analysis, of 
correlating quantitative protein and mRNA expression mea- 
surements of selected genes. 

The recent introduction of mass spectrometric protein anal- 
ysis techniques has dramatically enhanced the throughput and 
sensitivity of protein identification to a level which now permits 
the large-scale analysis of proteins separated by 2DE. The 
techniques have reached a level of sensitivity that permits the 
identification of essentially any protein that is detectable in the 
gels by conventional protein staining (9, 29). Current protein 
analytical technology is based on the mass spectrometric gen- 
eration of peptide fragment patterns that are idiotypic for the 
sequence of a protein. Protein identity is established by corre- 
lating such fragment patterns with sequence databases (10, 22, 
37). Sophisticated computer software (8) has automated the 
entire process such that proteins are routinely identified with 
no human interpretation of peptide fragment patterns. 

In this study, we have analyzed the mRNA and protein levels 
of a group of genes expressed in exponentially growing cells of 
the yeast S. cerevisiae. Protein expression levels were quantified 
by metabolic labeling of the yeast proteins to a steady state, 
followed by 2DE and liquid scintillation counting of the se- 
lected, separated protein species. Separated proteins were 
identified by in-gel tryptic digestion of spots with subsequent 
analysis by microspray liquid chromatography-tandem mass 
spectrometiy (LC-MS/MS) and sequence database searching. 
The corresponding mRNA transcript levels were calculated 
from SAGE frequency tables (35). 

This study, for the first time, explores a quantitative com- 
parison of mRNA transcript and protein expression levels for 
a relatively large number of genes expressed in the same met- 
abolic state. The resultant correlation is insufficient for predic- 
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FIG. 1. Schematic illustration of proteome analysis by 2DE and mass spectrometry. In part I, proteins are separated by 2DE, stained spots are excised and subjected 
to in>gel digestion with trypsin, and the resulting peptides are separated by on-line capillary high-performance liquid chromatography. In part II, a peptide is shown 
eluting from the column in part I. The peptide is ionized by electrospray ionization and enters the mass spectrometer. The mass of the ionized peptide is detected, and 
the first quadrupole mass filter allows only the specific mass-to-charge ratio of the selected peptide ion to pass into the collision cell. In the collision cell, the energized, 
ionized peptides collide with neutral argon gas molecules. Fragmentation of the peptide is essentially random but occurs mainly at the peptide bonds, resulting in smaller 
peptides of differing lengths (masses). These peptide fragments are detected as a tandem mass (MS/MS) spectrum in the third quadrupole mass filter where two ion 
scries are recorded simultaneously, one each from sequencing inward from the N and C termini of the peptide, respectively. In part III, the MS/MS spectrum from the 
selected, ionized peptide is compared to predicted tandem mass spectra computer generated from a sequence database. Provided that the peptide sequence exists in 
the database, the peptide and, by association, the protein from which the peptide was derived can be identified. Unambiguous protein identification is attained in a single 
analysis because multiple peptides are identified as being derived from the same protein. 



tion of protein levels from mRNA transcript levels. We have 
also compared the relative amounts of protein and mRNA 
with the respective codon bias values for the corresponding 
genes. This comparison indicates that codon bias by itself is 
insufficient to accurately predict either the mRNA or the pro- 
tein expression levels of a gene. In addition, the results dem- 
onstrate that only highly expressed proteins are detectable by 
2DE separation of total cell lysates and that therefore the 
construction of complete proteome maps with current technol- 
ogy will be very challenging, irrespective of the type of organ- 
ism. 

MATERIALS AND METHODS 

Yeast strain and growth conditions. The source of protein and message tran- 
scripts for all experiments was YPH499 {MATa ura3-S2 fysl-SOl ade2-101 
Ieu2-M his3'/S200 trpl'^63) (30). Ljogarithmically growing cells were obtained by 

growing yeast cells to early log phase (3 x 10*^ cells/ml) in YPD rich medium 
(YPD supplemented with 6 mM uracil, 4.8 mM adenine, and 24 mM tryptophan) 
at 3(fC (35). Metabolic labeling of protein was accomplished in YPD medium 



exactly as described elsewhere (4) with the exception that 1 ml of cells was 
labeled with 3 mCi to offset methionine present in YPD medium. Protein was 
harvested as described by Carrels and coworkers (12). Harvested protein was 
lyophilized, resuspended in isoelectric focusing gel rehydration solution, and 
stored at -80°C. 

2DE. Soluble proteins were run in the first dimension by using a commercial 
flatbed electrophoresis system (Muliiphor II; Pharmacia Biotech). Immobilized 
polyacrylamide gel (IPG) dry strips with nonlinear pH 3.0 to 10.0 gradients 
(Amersham-Pharmacia Biotech) were used for the first-dimension separation. 
Forty micrograms of protein from whole-cell lysates was mixed with IPG strip 
rehydration buffer (8 M urea, 2% Nonidet P-40, 10 mM dithiothreitol), and 250 
to 380 p.1 of solution was added to individual lanes of an IPG strip rehydration 
tray (Ajuersham-Pharmacia Biotech). The strips were allowed to rehydrate at 
room temperature for 1 h. The samples were run at 300 V-10 mA-5 W for 2 h, 
then ramped to 3,500 V-10 mA-5 W over a period of 3 h, and then kept at 3,500 
V-10 mA-5 W for 15 to 19 h. At the end of the first-dimension run (60 to 70 kV • 
h), the IPG strips were reequilibrated for 8 min in 2% (wt/vol) dithiothreitol in 
2% (wt/vol) SDS-6 M urca-30% (wt/vol) glycerol-0.05 M Tris HCI (pH 6.8) and 
for 4 min in 2.5% iodoacctamidc in 2% (wt/vol) SDS-6 M urca-30% (wt/vol) 
glycerol-0.05 M Tris HCI (pH 6.8), Following rcequilibration, the strips were 
transferred and apposed to 10% polyacrylamide second-dimension gels. Poly- 
aciytamide gels were poured in a casting stand with 10% aery lam idc-2.67% 
piperazinc diaciylamide^.375 M Tris base-HQ (pH 8.8>-0.1% (wt/vol) SDS-0.05% 



1722 GYGI ET AL. 



MoL. Cell. Biol. 



8^0 



100- 



5jO 



4.0 3.0 



O) 



■SS 40- 



o 



6j3 6 ^0 5.8 5 .5 5.2 



82 



60-, 



2* 50-L 



■ -t 15 9 ■ 




62 




134 



53 109^ IQS 




t f i 




30 — W «^ ^ ^ 



25- 
20" 



' « * ♦ *i r ' ' ■ 

^# < a4i8 ' # f 
,99 98 ^ V A..:^ Irt^iA^.... * 




|v4a ^I25^|fe i ^ i| -i^^l ^ f v; ii2/ ^1 



FIG. 2. 2D silver-stained gel of the proteins in yeast total cell lysate. Proteins were separated in the first dimension (horizontal) by isoelectric focusing and then in 
the second dimension (vertical) by molecular weight sieving. Protein spots (156) were chosen to include the entire range of molecular weights, isoelectric focusing points, 
and staining intensities. Spots were excised, and the corresponding protein was identified by mass spectrometiy and database searching. The spots are labeled on the 
gel and correspond to the data presented in Table 1. Molecular weights are given in thousands. 



(wt/vol) ammonium persulfate-0.05% TEMED (Af,A^^'A'-tetramethylethyl- 
enediaminc) in Milli-Q water. The apparatus used to run second-dimension gels 
was a noncommercial apparatus from Oxford Glycoscicnccs, Inc. Once the IPG 
strips were apposed to the second-dimension gels, they were immediately run at 
50 mA (constant)-500 W for 20 min, followed by 200 mA (constant)-500 
V-85 W until the buffer front line was 10 to 15 mm from the bottom of the gel. 
Gels were removed and silver stained according to the procedure of Shcvchcnko 
et al. (29). 

Protein identification. Gels were exposed to X-ray film overnight, and then the 
silver staining and film were used to excise 156 spots of varying intensities, 
molecular weights, and isoelectric focusing points. In order to increase the 
detection limit by mass spectrometry, spots were cut out and pooled from up to 
four identical cold, silver-stained gels. In-gel tryptic digests of pooled spots were 
performed as described previously (29). Tryptic peptides were analyzed by mi- 
crocapillary LC-MS with automated switching to MS/MS mode for peptide 
fragmentation. Spectra were searched against the composite OWL protein se- 
quence database (version 30,2; 250,514 protein sequences) (24a) by using the 
computer program Sequest (8), which matches theoretical and acquired tandem 
mass spectra. A protein match was determined by comparing the number of 
peptides identified and their respective cross-correlation scores. All protein 
identifications were verified by comparison with theoretical molecular weights 
and isoelectric points. 



mRNA quantitation. Velculescu and coworkers have previously generated 
frequency tables for yeast mRNA transcripts from the same strain grown under 
the same stated conditions as described herein (35). The SAGE technology is 
based on two main principles. First, a short sequence tag (15 bp) that contains 
sufficient information uniquely to identify a transcript is generated. A single tag 
is usually generated from each mRNA transcript in the cell which corresponds to 
15 bp at the 3'-most cutting site for Malll. Second, many transcript lags can be 
concatenated into a single molecule and then sequenced, revealing the identity of 
multiple tags simultaneously. Over 20,000 transcripts were sequenced from yeast 
strain YPH499 growing at mid-log phase on glucose. Assuming the previously 
derived estimate of 15,000 mRNA molecules per cell (16), this would represent 
a 1.3-fold coverage even for mRNA molecules present at a single copy per cell 
and would provide a 72% probability of detecting such transcripts. Computer 
software which took for input the gene detected, examined the nucleotide se- 
quence, and performed the calculation as described by Velculescu and coworkers 
(35) was written. In practice, we found that for 21 of 128 (16%) genes examined 
viable mRNA levels from SAGE data could not be calculated. This was because 
(i) no CATG site was found in the open reading frame (ORF), (ii) a CATG site 
was found but the corresponding 10-bp putative SAGE tag was not found in the 
frequency tables, or (iii) identical putative SAGE tags were present for multiple 
genes (e.g., TDH2_ YEAST and TDH3_YEAST). 
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TABLE 1. Expressed genes identified from 2D gel in Fig. 2 TABLE I— Continued 
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pl 


Spot no. 


name" 
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hi in Hun ff* 

(10^ copies/ 
cell) 


mRNA 
abundance 
(copies/cell) 


Codon 
bias 


17,259 


6.75 


133 


CPRl 


15.2 


61.7 


0.769 


39,477 


5.58 


86 


FBAl 


17.8 


183.6 


0.935 


18,702 


4.80 


83 


EGD2 


20.1 


5.2 


0.724 


39,477 


5.58 


87 


FBAl 


427.2 


183.6 


0.935 


18,726 


4.44 


147 


YKL056C 


61.2 


88.4 


0.831 


39,540 


6.50 


150 


HOM2 


60.3 


4.5 


0.592 


18,978 


5.95 


135 


YER067W 


3.7 


6.7 


0.118 


39,561 


6.12 


156 


PSAl 


96.4 


27.5 


0.718 


19,108 


5.04 


130 


YLR109W 


94.4 


9.7 


0.680 


41,158 


6.01 


49 


YNL134C 


14.9 


1.5 


0.316 


19,681 


9.08 


136 


ATP7 


11.0 


NA*-^ 


0.246 


41,623 


7.18 


58 


BAT2 


19.0 


8.9 


0.250 


20,505 


6.07 


111 
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16.5 


3.7 


0.422 


41,728 


7.29 


110 
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24.1 


4.5 


0.543 


21,444 


5.25 


148 


SARI 


5.4 


10.4 


0.455 


41,900 


5.42 


74 


TOM40 


22.3 


2.2 


0.375 


21,583 


4.98 


95 


TSAl 


110.6 


40,1 


0.845 


42,402 


6.29 


45 


CYS3 


6.7 


8.9 


0.621 


22,602 


4.30 


80 


EFBl 


66.1 


23.8 


0.875 


42,883 


5.63 


67 


DYSl 


15.8 


5.2 


0.526 


23,079 


6.29 


112 


SOD2 


12.6 


2.2 


0.351 


43,409 


6.31 


107 


SERl 


10.5 


1.5 


0.292 


23,743 


5.44 


137 


HSP26 


NA'' 


0.7 


0.434 


43,421 


5.59 


91 


ERG6 


2.2 


14.1 


0.408 


24,033 


5.97 


96 


ADKl 


17.4 


16.4 


0.656 


44,174 


7.32 


56 


YBR025C 


13.1 


6.0 


0.684 


24,058 


4.43 


143 


YKL117W 


29.2 


10.4 


0.339 


44,682 


4.99 


72 


TIFl 


2.9 


39.4 


0.834 


24,353 


6.30 


140 


TFSl 


8.1 


0.7 


0,146 


44,707 


7.77 


108 


PGKl 


23.7 


165.7 


0.897 


24,662 


5.85 


99 


URA5 


25.4 


6.0 


0.359 


44,707 


7.77 


109 


PGKl 


315.2 


165.7 


0.897 


24,808 


6.33 


97 


GSPl 


26.3 


5.2 


0.735 


46,080 


6.72 


30 


CAR2 


15.4 


NA*^ 


0.495 


24,908 


8.73 


122 


RPS5 


18.6 


NA^ 


0.899 


46,383 


8.52 


53 


IDPl 


7.7 


0.7 


0.436 


25,081 


4.65 


81 


MRP8 


9.3 


NA^ 


0.241 


46,553 


5.98 


47 


IDP2 


32.4 


NA^ 


0.197 


25,960 


6.06 


116 


RPEl 


5.8 


0.7 


0.372 


46,679 


6.39 


50 


ENOl 


35.4 


0.7 


0.930 


26,378 


9.55 


127 
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96.8 


NA^- 


0.863 


46,679 


6.39 


51 


ENOl 


6.6 


0.7 


0.930 


26,467 


5.18 


100 


VMA4 


10.5 


3.7 


0.427 


46,679 


6.39 


52 


ENOl 


2.2 


0.7 


0.930 


26,661 


5.84 


98 
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0.900 


46,773 


5.82 


63 
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15.5 


289.1 


0.960 
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5.56 


93 
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0.7 


0.129 


46,773 


5.82 


64 
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6.13 


115 
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65 
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27,480 


8.95 
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GPMl 


231.4 
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0.902 


47,666 


8.98 


54 
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11.7 


6.0 


0.338 


27,480 


8.95 
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GPMl 


7.5 


169.4 


0.902 


48,364 


5.25 


73 


WTMl 


74.5 


13.4 


0.365 


27,809 


5.97 


139 


HOR2 


5.7 


0.7 


0.381 


48,530 


6.20 


61 


MET17 


38.1 


29.0 


0.576 


27,874 


4.46 


78 


YSTl 


13.6 


52.8 


0.805 


48,904 


5.18 


69 


LYS9 


16.2 


3.7 


0.463 


28,595 


4.51 


41 


PUP2 


4.4 


0.7 


0,147 


48,987 


4.90 


153 


SUP45 


29.6 


11.9 


0.377 


29,156 


6.59 


114 


YMR226C 


14.5 


2.2 


0.283 


49,727 


5.47 


70 


PR02 


13.6 


5.2 


0.297 


29,244 


8.40 


120 


DPMI 


5.0 


11.2 


0.362 


49,912 


9.27 


62 


TEF2 


558.5 


282.0 


0.932 


29,443 


5.91 


48 


PRE4 


3.4 


3.7 


0.162 


50,444 


5.67 


35 


YDR190C 


4.8 


2.2 


0.228 


30,012 


6.39 


138 


PRBl 


21.2 


1.5 


0.449 


50,837 


6.11 


32 


YEL047C 


3.8 


1.5 


0.387 


30,073 


4.63 


77 


BMHl 


14.7 


28.2 


0.454 


50,891 


4.59 


151 


TUB2 


11.2 


7.4 


0.404 


30,296 


7.94 


121 


0MP2 


67.4 


41.6 


0.499 


51,547 


6.80 


27 


LPDl 


18.9 


2.2 


0.351 


30,435 


6.34 


89 


GPPl 


70.2 


11.2 


0.703 


52,216 


7.25 


29 


SHM2 


19.7 


7.4 


0.722 


31,332 


5.57 


88 


ILV6 


13.9 


3.0 


0.402 


52,859 


5.54 


37 
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30.2 


6.7 


0.442 


32,159 


5.46 


113 


IPPl 


63.1 


3.7 


0.752 


53,798 


5.19 


71 
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26.5 
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0.756 


32,263 


6,00 


149 


HISl 


22.4 
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0.232 


53,803 


6.05 


145 


GYP6 
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5.35 


84 
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54,403 
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39 
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37.7 
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34,465 


5.60 


129 
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5.2 


0.305 


54.403 


5.29 


40 
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6.6 


2.2 


0.664 


34,762 


5.32 


85 


SEC14 


10.9 


6.0 


0.373 


54,502 


6.20 


31 


ADE13 


6.3 


1.5 


0.417 


34,797 


5.85 


42 


URAl 


49.5 


8.9 


0.237 


54,543 


7.75 


25 


PYKl 


225.3 


101.8 


0.965 


34,799 


6.04 


90 


BELl 


103.2 


81.0 


0.875 


54,543 


7.75 


26 


PYKl 


39.8 


101.8 


0.965 


35,556 


5.97 


43 


YDL124W 


6.4 


4.5 


0.206 


55,221 


6.66 


146 


YEL071W 


16.3 


3.0 


0.244 


35,619 


8.41 


59 


TDHl 


69.8 


32.r 


0.940 


55,295 


4.35 


134 


PDIl 


66.2 


14.1 


0.589 


35,650 


5.49 


68 


CARl 


5.2 


3.0 


0.339 


55,364 


5.98 


24 


GLKl 


22.6 


6.0 


0.237 


35,712 


6.72 


117 


TDH2 


49.6 


473.0^ 


0.982 


55,481 


7.97 


118 


ATPl 


21.6 


2.2 


0.637 


35,712 


6.72 


154 


TDH2 


863.5 


473.0^ 


0.982 


55,886 


6.47 


28 


CYS4 


22.2 


NA'^ 


0.444 


35,712 


6.72 


155 


TDH2 


79.4 


473.0" 


0.982 


56,167 


5,83 


33 


AR08 


14.3 


3.0 


0.324 


36,272 


4.85 


128 


APAl 


8.7 


0.7 


0.425 


56,167 


5.83 


34 


AR08 


9.1 


3.0 


0.324 


36,358 


5.05 


75 


YJR105W 


17.6 


17.1 


0.522 


56,584 


6.36 


20 


CYB2 


18.9 


NA" 


0.259 


36,358 


5.05 


76 


YJR105W 


27.5 


17.1 


0.522 


57,366 


5.53 


60 


FRS2 


2.3 


0.7 


0.451 


36,596 


6.37 


79 


ADH2 


58.9 


260.0^ 


0.711 


57,383 


5.98 


144 


ZWFl 


5.6 


0.7 


0.215 


36,714 


6.30 


102 


ADHl 


746.1 


260.0 


0.913 


57,464 


5.49 


36 


THR4 


21.4 


3.7 


0.508 


36,714 


6.30 


103 


ADHl 


17.6 


260.0 


0.913 


57,512 


5.50 


7 


SRV2 


6.5 


NA" 


0.260 


36,714 


6.30 


104 


ADHl 


61.4 


260.0 


0.913 


57,727 


4.92 


152 


VMA2 


33.7 


8.9 


0.546 


36,714 


6.30 


105 


ADHl 


52.7 


260.0 


0.913 


58,573 


6.47 


17 


ACHl 


4.4 


1.5 


0.327 


37,033 


6.23 


44 


TALI 


44.8 


3.7 


0.701 


58,573 


6.47 


18 


ACHl 


5.4 


1.5 


0.327 


37,796 


7.36 


57 


IDH2 


29.4 


6.7 


0.330 


61,353 


5.87 


21 


PDCl 


6.5 


200.7 


0.962 


37,886 


6.49 


106 


ILV5 


76,0 


4.5 


0.892 


61,353 


5.87 


22 


PDCl 


303.2 


200.7 


0.962 


38,700 


7.83 


55 


BATl 


30.9 


11.2 


0.469 


61,353 


5.87 


23 


PDCl 


16.3 


200.7 


0.962 


38,702 


6.24 


46 


QCR2 


NA'' 


2.2 


0.326 


61,649 


5.54 


38 


CCT8 


2.2 


1.5 


0.271 
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TABLE "[Continued 



Mol wt 




Spot no. 


irU gene 
name 


Protein 
abundance 

cell) 


mRNA 
abundance 
(copics/ccU) 


Codon 
bias 


61,902 


6.21 


101 


PDC5 


4.3 


NA*' 


0.828 


62,266 


6.19 


16 


TCLl 


20.1 


NA*^ 


0.327 


62,862 


8.02 


19 


ILV3 


5.3 


4.5 


0.548 


63,082 


6.40 


119 


PGM2 


2.2 


3.0 


0.402 


64,335 


5.77 


5 


PABl 


30.4 


1.5 


0.616 


66,120 


5.42 


8 


STIl 


6.7 


0.7 


0.313 


66,120 


5.42 


9 


STIl 


6.4 


0.7 


0.313 


66,450 


5.29 


141 


SSB2 


7.0 


NA'^ 


0.880 


66,450 


5.29 


142 


SSB2 


2.3 


NA^ 


0.880 


66,456 


5.23 


10 


SSBl 


64.5 


79.5 


0.907 


66,456 


5.23 


11 


SSBl 


59.0 


79.5 


0.907 


66,456 


5.23 


12 


SSBl 


13.7 


79.5 


0.907 


68,397 


5.82 


82 


LEU4 


3.1 


3.0 


0.407 


69,313 


4.90 


13 


SSA2 


24.3 


18.6 


0.892 


69,313 


4.90 


14 


SSA2 


77.1 


18.6 


0.892 


74,378 


8.46 


15 


YKL029C 


2.8 


3.7 


0.353 


75,396 


5.82 


6 


GRSl 


5.5 


7.4 


0.500 


85,720 


6.25 


1 


MET6 


2.0 


NA" 


0.772 


85,720 


6.25 


2 


MET6 


10.9 


NA^ 


0.772 


85,720 


6.25 


3 


MET6 


1.4 


NA^ 


0.772 


93,276 


6.11 


131 


EFTl 


17.9 


41.6 


0.890 


93,276 


6.11 


132 


EFTl 


5.7 


41.6 


0.890 


102,064*^ 


6.6r 


94 


ADE3 


4.8 


5.2 


0.423 


107,482' 


5.33*^ 


4 


MCM3 


2.7 


NA^ 


0.240 



" YPD gene names are available from the YPD website (39). 
* NA, calculation could not be performed or was not available. 
mRNA data inconclusive or NA. 

No methionines in predicted ORF; therefore, protein concentration was not 
determined. 

Measured molecular weight or pJ did not match theoretical molecular weight 
or pL 



Protein quantitation. [■'^Slmethionine-labeled gels were exposed to X-ray film 
overnight, and then the silver stain and film were used to excise 156 spots of 
varying intensities, molecular weights, and pis. The excised spots were placed in 
0.6-ml microcentrifuge tubes, and scintillation cocktail (100 |xl) was added. The 
samples were vortexed and counted. In addition, two parallel gels were electro- 
blotted to polyvinylidene difluoride membranes. The membranes were exposed 
to X-ray film, and four intense single spots were excised from each membrane 
and subjected to amino acid analysis. For these four spots, a mean of 209 ± 4 
cpm/pmol of protein/methionine was found. This number was used to quantitate 
all remaining spots in conjunction with the number of methionines present in the 
protein. 

To ensure that proteins were labeled to equilibrium, parallel 2D gels were 
prepared and run on yeast metabolically labeled for 1, 2, 6, or 18 h. TTie 
corresponding 156 spots were excised from each gel, and radioactivity was mea- 
sured by liquid scintillation counting for each spot. Calculated protein levels were 
highly reproducible for all time points measured after 1 h. 

Calculation of codon bias and predicted half-life. Codon bias values were 
extracted from the YPD spreadsheet (17). Protein half-lives were calculated 
based on the N-cnd rule (33). When the N-terminal processing was not known 
experimentally, it was predicted based on the affinity of methionine aminopep- 
tidasc (31). 

RESULTS 

Characteristics of proteome approach. Nearly every facet of 
proteome analysis hinges on the unambiguous identification of 
large numbers of expressed proteins in cells. Several tech- 
niques have been described previously for the identification of 
proteins separated by 2DE, including N-terminal and internal 
sequencing (1, 2), amino acid analysis (38), and more recently 
mass spectrometry (25). We utilized techniques based on mass 
spectrometry because they afford the highest levels of sensitiv- 
ity and provide unambiguous identification. The specific pro- 
cedure used is schematically illustrated in Fig. 1 and is based 
on three principles. First, proteins are removed from the gel by 



proteolytic in-gel digestion, and the resulting peptides are sep- 
arated by on-line capillary high-performance liquid chromatog- 
raphy. Second, the eluting peptides are ionized and detected, and 
the specific peptide ions are selected and fragmented by the 
mass spectrometer. To achieve this, the mass spectrometer 
switches between the MS mode (for peptide mass identifica- 
tion) and the MS/MS mode (for peptide characterization and 
sequencing). Selected peptides are fragmented by a process 
called collision-induced dissociation (CID) to generate a tan- 
dem mass spectrum (MS/MS spectrum) that contains the pep- 
tide sequence information. Third, individual CID mass spectra 
are then compared by computer algorithms to predicted spec- 
tra from a sequence database. This results in the identification 
of the peptide and, by association, the protein(s) in the spot. 
Unambiguous protein identification is attained in a single anal- 
ysis by the detection of multiple peptides derived from the 
same protein. 

Protein identification. Yeast total cell protein lysate (40 pig), 
metabolically labeled with [-^^Slmethionine, was electro- 
phoretically separated by isoelectric focusing in the first dimen- 
sion and by SDS-10% polyacrylamide gel electrophoresis in 
the second dimension. Proteins were visualized by silver stain- 
ing and by autoradiography. Of the more than 1,000 proteins 
visible by silver staining, 156 spots were excised from the gel 
and subjected to in-gel tryptic digestion, and the resulting 
peptides were analyzed and identified by microspray LC- 
MS/MS techniques as described above. The proteins in this 
study were all identified automatically by computer software 
with no human interpretation of mass spectra. They are indi- 
cated in Fig. 2 and detailed in Table 1. 

The CID spectra shown in Fig. 3 indicate that the quality of 
the identification data generated was suitable for unambiguous 
protein identification. The spectra represent the amino acid 
sequences of tiyptic peptides NSGDIVNLGSIAGR (Fig. 3A) 
and FAVGAFTDSLR (Fig. 3B). Both peptides were derived 
from protein S57593 (hypothetical protein YMR226C), which 
migrated to spot 114 (molecular weight, 29,156; pi, 6.59) in the 
2D gel in Fig. 2. Five other peptides from the same analysis 
were also computer matched to the same protein sequence. 

Protein and mRNA quantitation. For the 156 genes investi- 
gated, the protein expression levels ranged from 2,200 (PGM2) 
to 863,000 (TDH2ArDH3) copies/cell. The levels of mRNA for 
each of the genes identified were calculated from SAGE fre- 
quency tables (35). These tables contain the mRNA levels for 
4,665 genes in yeast strain YPH499 grown to mid-log phase in 
YPD medium on glucose as a carbon source. In some in- 
stances, the mRNA levels could not be calculated for reasons 
stated in Materials and Methods. For the proteins analyzed in 
this study, mean transcript levels varied from 0.7 to 473 copies/ 
cell. 

Selection of the sample population for mRNA-protein ex- 
pression level correlation. The protein spots selected for iden- 
tification were selected from spots visible by silver staining in 
the 2D gel. An attempt was made not to include spots where 
overlap with other spots was readily apparent. The number of 
proteins identified was 156 (Table 1). Some proteins migrated 
to more than one spot (presumably due to differential protein 
processing or modifications), and protein levels from these 
spots were calculated by integrating the intensities of the dif- 
ferent spots. The 156 protein spots analyzed represented the 
products of 128 different genes. Genes were excluded from the 
correlation analysis only if part of the data set was missing; i.e., 
genes were excluded if (i) no mRNA expression data were 
available for the protein or putative SAGE tags were ambig- 
uous, (ii) the amino acid sequence did not contain methionine, 
(iii) more than a single protein was conclusively identified as 
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FIG. 3. Tandem mass (MS/MS) spectra resulting from analysis of a single spot on a 2D gel. The first quadrupolc selected a single mass-to-charge ratio (m/z) of 687.2 
(A) or 592.6 (B), while the collision cell was filled with argon gas, and a voltage which caused the peptide to undergo fragmentation by CID was applied. The third 
quadrupole scanned the mass range from 50 to 1,400 miz. The computer program Sequest (8) was utilized to match MS/MS spectra to amino acid sequence by database 
searching. Both spectra matched peptides from the same protein, S57593 (yeast hypothetical protein YMR226C). Five other peptides from the same analysis were 
matched to the same protein. 



migrating to the same gel spot, or (iv) the theoretical and 
observed pis and molecular weights could not be reconciled. 
After these criteria were applied, the number of genes used in 
the correlation analysis was 106. 



Codon bias and predicted half-lives. Codon bias is thought 
to be an indicator of protein expression, with highly expressed 
proteins having large codon bias values. The codon bias distri- 
bution for the entire set of more than 6,000 predicted yeast 
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gene ORFs is presented in Fig. 4A. The interval with the 
largest frequency of genes is between the codon bias values of 
0.0 and 0.1. This segment contains more than 2,500 genes. The 
distribution of the codon bias values of the 128 different genes 
found in this study (all protein spots from Fig. 2) is shown in 
Fig. 4B, and protein half-lives (predicted from applying the 
N-end rule [33] to the experimentally determined or predicted 
protein N termini) are shown in Fig. 4C. No genes were iden- 
tified with codon bias values less than 0.1 even though thou- 
sands of genes exist in this category. In addition, nearly all of 
the proteins identified had long predicted half-lives (greater 
than 30 h). 

Correlation of mRNA and protein expression levels. The 

correlation between mRNA and protein levels of the genes 
selected as described above is shown in Fig. 5. For the entire 
group (106 genes) for which a complete data set was gener- 
ated, there was a general trend of increased protein levels 
resulting from increased mRNA levels. The Pearson product 
moment correlation coefficient for the whole data set (106 
genes) was 0.935. This number is highly biased by a small 
number of genes with very large protein and message levels, A 
more representative subset of the data is shown in the inset of 
Fig. 5. It shows genes for which the message level was below 10 
copies/cell and includes 69% (73 of 106 genes) of the data used 
in the study. The Pearson product moment correlation coeffi- 
cient for this data set was only 0.356. We also found that levels 
of protein expression coded for by mRNA with comparable 
abundance varied by as much as 30-fold and that the mRNA 
levels coding for proteins with comparable expression levels 
varied by as much as 20-fold. 

The distortion of the correlation value induced by the un- 
even distribution of the data points along the x axis is further 
demonstrated by the analysis in Fig. 6. The 106 samples in- 
cluded in the study were ranked by protein abundance, and the 
Pearson product moment correlation coefficient was repeat- 
edly calculated after including progressively more, and higher- 
abundance, proteins in each calculation. The correlation values 
remained relatively stable in the range of 0.1 to 0.4 if the 
lowest-expressed 40 to 95 proteins used in this study were 
included. However, the correlation value steadily climbed by 
the inclusion of each of the 11 very highly expressed proteins. 

Correlation of protein and mRNA expression levels with 
codon bias. Codon bias is the propensity for a gene to utilize 
the same codon to encode an amino acid even though other 
codons would insert the identical amino acid in the growing 
polypeptide sequence. It is further thought that highly ex- 
pressed proteins have large codon biases (3). To assess the 
value of codon bias for predicting mRNA and protein levels in 
exponentially growing yeast cells, we plotted the two experi- 
mental sets of data versus the codon bias (Fig. 7). The distri- 
bution patterns for both mRNA and protein levels with respect 
to codon bias were highly similar. There was high variability in 
the data within the codon bias range of 0.8 to 1.0. Although a 
large codon bias generally resulted in higher protein and mes- 
sage expression levels, codon bias did not appear to be predic- 
tive of either protein levels or mRNA levels in the cell. 

DISCUSSION 

The desired end point for the description of a biological 
system is not the analysis of mRNA transcript levels alone but 
also the accurate measurement of protein expression levels and 
their respective activities. Quantitative analysis of global 
mRNA levels currently is a preferred method for the analysis 
of the state of cells and tissues (11). Several methods which 
either provide absolute mRNA abundance (34, 35) or relative 
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FIG. 4. Current proteome analysis technology utilizing 2DE without preen- 
richment samples mainly highly expressed and long-lived proteins. Genes encod- 
ing highly expressed proteins generally have large codon bias values. (A) Distri- 
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interval with the largest frequency of genes is 0.0 to 0.1, with more than 2,500 
genes. (B) Distribution of the genes from identified proteins in this study based 
on codon bias. No genes with codon bias values less than 0.1 were detected in this 
study. (C) Distribution of identified proteins in this study based on predicted 
half-life (estimated by N-end rule). 



mRNA levels in comparative analyses (20, 27) have been de- 
scribed elsewhere. The techniques are fast and exquisitely sen- 
sitive and can provide mRNA abundance for potentially any 
expressed gene. Measured mRNA levels are often implicitly or 
explicitly extrapolated to indicate the levels of activity of the 
corresponding protein in the cell. Quantitative analysis of pro- 
tein expression levels (proteome analysis) is much more time- 
consuming because proteins are analyzed sequentially one by 
one and is not general because analyses are limited to the 
relatively highly expressed proteins. Proteome analysis does, 
however, provide types of data that are of critical importance 
for the description of the state of a biological system and that 
are not readily apparent from the sequence and the level of 
expression of the mRNA transcript. This study attempts to 
examine the relationship between mRNA and protein expres- 
sion levels for a large number of expressed genes in cells 
representing the same state. 

Limits in the sensitivity of current protein analysis technol- 
ogy precluded a completely random sampling of yeast proteins. 
We therefore based the study on those proteins visible by silver 
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FIG. 5. Correlation between protein and mRNA levels for 106 genes in yeast growing at log phase with glucose as a carbon source. mRNA and protein levels were 
calculated as described in Materials and Methods. The data represent a population of genes with protein expression levels visible by silver staining on a 2D gel chosen 
to include the entire range of molecular weights, isoelectric focusing points, and staining intensities. The inset shows the low-end portion of the main figure. It contains 
69% of the original data set. The Pearson product moment correlation for the entire data set was 0.935. The correlation for the inset containing 73 proteins (69%) was 
only 0.356. 



Staining on a 2D gel. Of the more than 1,000 visible spots, 156 
were chosen to include the entire range of molecular weights, 
isoelectric focusing points, and staining intensities displayed on 
the 2D protein pattern. The genes identified in this study 
shared a number of properties. First, all of the proteins in this 
study had a codon bias of greater than 0.1 and 93% were 
greater than 0.2 (Fig. 4B). Second, with few exceptions, the 
proteins in this study had long predicted half-lives according to 
the N-end rule (Fig. 4C). Third, low-abundance proteins with 
regulatory functions such as transcription factors or protein 
kinases were not identified. 

Because the population of proteins used in this study ap- 
pears to be fairly homogeneous with respect to predicted half- 
life and codon bias, it might be expected that the correlation of 
the mRNA and protein expression levels would be stronger for 
this population than for a random sample of yeast proteins. We 
tested this assumption by evaluating the correlation value if 
different subsets of the available data were included in the 
calculation. The 106 proteins were ranked from lowest to high- 
est protein expression level, and the trend in the correlation 
value was evaluated by progressively including more of the 
higher-abundance proteins in the calculation (Fig. 6). The cor- 
relation value when only the lower-abundance 40 to 93 pro- 
teins were examined was consistently between 0.1 and 0.4. If 
the 11 most abundant proteins were included, the correlation 
steadily increased to 0.94, We therefore expect that the corre- 
lation for all yeast proteins or for a random selection would be 
less than 0.4. The observed level of correlation between 
mRNA and protein expression levels suggests the importance 



of posttranslational mechanisms controlling gene expression. 
Such mechanisms include translational control (15) and con- 
trol of protein half-life (33). Since these mechanisms are also 
active in higher eukaryotic cells, we speculate that there is no 
predictive correlation between steady-state levels of mRNA 
and those of protein in mammalian cells. 

Like other large-scale analyses, the present study has several 
potential sources of error related to the methods used to de- 
termine mRNA and protein expression levels. The mRNA 
levels were calculated from frequency tables of SAGE data. 
This method is highly quantitative because it is based on actual 
sequencing of unique tags from each gene, and the number of 
times that a tag is represented is proportional to the number of 
mRNA molecules for a specific gene. This method has some 
limitations including the following: (i) the magnitude of the 
error in the measurement of mRNA levels is inversely propor- 
tional to the mRNA levels, (ii) SAGE tags from highly similar 
genes may not be distinguished and therefore are summed, (iii) 
some SAGE tags are from sequences in the 3' untranslated 
region of the transcript, (iv) incomplete cleavage at the SAGE 
tag site by the restriction enzyme can result in two tags repre- 
senting one mRNA, and (v) some transcripts actually do not 
generate a SAGE tag (34, 35). 

For the SAGE method, the error associated with a value 
increases with a decreasing number of transcripts per cell. The 
conclusions drawn from this study are dependent on the qual- 
ity of the mRNA levels from previously published data (35). 
Since more than 65% of the mRNA levels included in this 
study were calculated to 10 copies/cell or less (40% were less 
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FIG. 6. Effect of highly abundant proteins on Pearson product moment correlation coefficient for mRNA and protein abundance in yeast. The set of 106 genes was 
ranked according to protein abundance, and the correlation value was calculated by including the 40 lowest-abundance genes and then progressively including the 
remaining 66 genes in order of abundance. The correlation value climbs as the final 1 1 highly abundant proteins are included. 



than 4 copies/cell), the error associated with these values may 
be quite large. The mRNA levels were calculated from more 
than 20,000 transcripts. Assuming that the estimate of 15,000 
mRNA molecules per cell is correct (16), this would mean that 
mRNA transcripts present at only a single copy per cell would 
be detected 72% of the time (35). The mRNA levels for each 
gene were carefully scrutinized, and only mRNA levels for 
which a high degree of confidence existed were included in the 
correlation value. 

Protein abundance was determined by metabolic radiolabel- 
ing with [^^SJmethionine. The calculation required knowledge 
of three variables: the number of methionines in the mature 
protein, the radioactivity contained in the protein, and the 
specific activity of the radiolabel normalized per methionine. 
The number of methionines per protein was determined from 
the amino acid sequence of the proteins identified by tandem 
mass spectrometry. For some proteins, it was not known 
whether the methionine of the nascent polypeptide was pro- 
cessed away. The N termini of those proteins were predicted 
based on the specificity of methionine aminopeptidase (31). If 
the N-terminal processing did not conform to the predicted 
specificity of processing enzymes, the calculation of the num- 
ber of methionines would be affected. This discrepancy would 
affect most the quantitation of a protein with a very low num- 
ber of methionines. The average number of calculated methi- 
onines per protein in this study was 7.2. We therefore expect 
the potential for erroneous protein quantitation due to un- 
usual N-terminal processing to be small. 



The amount of radioactivity contained in a single spot might 
be the sum of the radioactivity of comigrating proteins. Be- 
cause protein identification was based on tandem mass spec- 
trometric techniques, comigrating proteins could be identified. 
However, comigrating proteins were rarely detected in this 
study, most likely because relatively small amounts of total 
protein (40 \ig) were initially loaded onto the gels, which re- 
sulted in highly focused spots containing generally 1 to 25 ng of 
protein. Because of the relatively small amount loaded, the 
concentrations of any potentially comigrating protein would 
likely be below the limit of detection of the mass spectrometry 
technique used in this study (1 to 5 ng) and below the limit of 
visualization by silver staining (1 to 5 ng). In the overwhelming 
majority of the samples analyzed, numerous peptides from a 
single protein were detected. It is assumed that any comigrat- 
ing proteins were at levels too low to be detected and that their 
influence in the calculation would be small. 

The specific activity of the radiolabel was determined by 
relating the precise amount of protein present in selected spots 
of a parallel gel, as determined by quantitative amino acid 
composition analysis, to the number of methionines present in 
the sequence of those proteins and the radioactivity deter- 
mined by liquid scintillation counting. It is possible that the 
resulting number might be influenced by unavoidable losses 
inherent in the amino acid analysis procedure applied. Because 
four different proteins were utilized in the calculation and the 
experiment was done in duplicate, the specific activity calcu- 
lated is thought to be highly accurate. Indeed, the specific 
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FIG. 7. Relationship between codon bias and protein and mRNA levels in this study. Yeast mRNA and protein expression levels were calculated as described in 
Materials and Methods. The data represent the same 106 genes as in Fig. 5. 



activities calculated for each of the four proteins varied by less 
than 10%. Any inconsistencies in the calculation of the specific 
activity would result in differences in the absolute levels calcu- 
lated but not in the relative numbers and would therefore not 
influence the correlation value determined. 

The protein quantitative method used eliminates a number 
of potential errors inherent in previous methods for the quan- 
titation of proteins separated by 2DE, such as preferential 
protein staining and bias caused by inequalities in the number 
of radiolabeled residues per protein. Any 2D gel-based method 
of quantitation is complicated by the fact that in some cases the 
translation products of the same mRNA migrated to different 
spots. One major reason is posttranslational modification or 
processing of the protein. Also, artifactual proteolysis during 
cell lysis and sample preparation can lead to multiple resolved 
forms of the protein. In such cases, the protein levels of spots 
coded for by the same mRNA were pooled. In addition, the 
existence of other spots coded for by the same mRNA that 
were not analyzed by mass spectrometry or that were below the 
limit of detection for silver staining cannot be ruled out. How- 
ever, since this study is based on a class of highly expressed 
proteins, the presence of undetected minor spots below silver 
staining sensitivity corresponding to a protein analyzed in the 
study would generally cause a relatively small error in protein 
quantitation. 

Codon bias is a measure of the propensity of an organism to 
selectively utilize certain codons which result in the incorpo- 
ration of the same amino acid residue in a growing polypeptide 
chain. There are 61 possible codons that code for 20 amino 
acids. The larger the codon bias value, the smaller the number 
of codons that are used to encode the protein (19). It is 



thought that codon bias is a measure of protein abundance 
because highly expressed proteins generally have large codon 
bias values (3, 13). 

Nearly all of the most highly expressed proteins had codon 
bias values of greater than 0.8. However, we detected a number 
of genes with high codon bias and relative low protein abun- 
dance (Fig. 7). For example, the expressed gene with both the 
second largest protein and mRNA levels in the study was 
EN02_YEAST (775,000 and 289.1 copies/cell, respectively). 
ENOl^YEAST was also present in the gel at much lower 
protein and mRNA levels (44,200 and 0.7 copies/cell, respec- 
tively). The codon bias values for EN02 and ENOl are similar 
(0.96 and 0.93, respectively), but the expression of the two 
genes is differentially regulated. Specifically, EN01_YEAST is 
glucose repressed (6) and was therefore present in low abun- 
dance under the conditions used. Other genes with large codon 
bias values that were not of high protein abundance in the gel 
include EFTl, TIFl, HXK2, GSPl, EGD2, SHM2, and TALI. 
We conclude that merely determining the codon bias of a gene 
is not sufficient to predict its protein expression level. 

Interestingly, codon bias appears to be an excellent indicator 
of the boundaries of current 2D gel proteome analysis tech- 
nology. There are thousands of genes with expressed mRNA 
and likely expressed protein with codon bias values less than 
0,1 (Fig. 4A). In this study, we detected none of them, and only 
a very small percentage of the genes detected in this study had 
codon bias values between 0.1 and 0.2 (Fig. 4B). Indeed, in 
every examined yeast proteome study (5, 7, 13, 28) where the 
combined total number of identified proteins is 300 to 400, this 
same observation is true. It is expected that for the more 
complex cells of higher eukaryotic organisms the detection of 



1730 GYGI ET AL. 



MoL. Cell. Biol. 



low-abundance proteins would be even more challenging than 
for yeast. This indicates that highly abundant, long-lived pro- 
teins are overwhelmingly detected in proteome studies. If pro- 
teome analysis is to provide truly meaningful information 
about cellular processes, it must be able to penetrate to the 
level of regulatory proteins, including transcription factors and 
protein kinases. A promising approach is the use of narrow- 
range focusing gels with immobilized pH gradients (IPG) (23). 
This would allow for the loading of significantly more protein 
per pH unit covered and also provide increased resolution of 
proteins with similar electrophoretic mobilities. A standard pH 
gradient in an isoelectric focusing gel covers a 7-pH-unit range 
(pH 3 to 10) over 18 cm. A narrow-range focusing gel might 
expand the range to 0.5 pH units over 18 cm or more. This 
could potentially increase by more than 10-fold the number of 
proteins that can be detected. Clearly, current proteome tech- 
nology is incapable of analyzing low-abundance regulatory pro- 
teins without employing an enrichment method for relatively 
low-abundance proteins. In conclusion, this study examined 
the relationship between yeast protein and message levels and 
revealed that transcript levels provide little predictive value 
with respect to the extent of protein expression. 

ACKNOWLEDGMENTS 

This work was supported by the National Science Foundation Sci- 
ence and Technology Center for Molecular Biotechnology, NTH grant 
T32HG00035-3, and a grant from Oxford Glycosciences. 

We thank Jimmy Eng for expert computer programming, Garry 
Corthals and John R. Yates III for critical discussion, and Siavash 
Mohandesi for expert technical help. 

REFERENCES 

1. Aebersold, R. H., J. Leavitt, R. A. Saavedra, L. E. Hood, and S. B. Kent 
1987. Internal amino acid sequence analysis of proteins separated by one- or 
two-dimensional gel electrophoresis after in situ protease digestion on ni- 
trocellulose. Proc. Natl. Acad. Sci. USA 84:6970-6974. 

2. Aebersold, R. H., D. B. Teplow, L. E. Hood, and S. B. Kent. 1986. Electro- 
blotting onto activated glass. High efficiency preparation of proteins from 
analytical sodium dodecyl sutfate-polyacrylamide gels for direct sequence 
analysis. Eur. J. Biochem. 261:4229-4238. 

3. Bennetzen, J. L., and B. D. Hall. 1982. Codon selection in yeast. J. Biol. 
Chcm. 257:3026-3031. 

4. Boucherie, H., G. Dujardin, M. Kermorgant, C. Monribot, P. Slonimski, and 
M. Perrot. 1995. Two-dimensional protein map of Saccharomyces cerevisiae: 
construction of a gene-protein index. Yeast 11:601-613. 

5. Boucherie, H., F. Sagliocco, R* Joubert, I. Maillet, J. Labarre, and M. Perrot. 
1996. Two-dimensional gel protein database of Saccharomyces cerevisiae. 
Electrophoresis 17:1683-1699. 

6. Carmen, A. A^ P. K. Brindle, C. S. Park, and M. J. Holland. 1995. Tran- 
scriptional regulation by an upstream repression sequence from the yeast 
enolase gene ENOl. Yeast 11:1031-1043. 

7. Ducret, A., 1. VanOostveen, J. K. Eng, J. R. Yates, and R. Aebersold. 1998. 
High throughput protein characterization by automated reverse-phase chro- 
matography/electrospray tandem mass spectrometry. Protein Sci. 7:706-719. 

8. Eng, J., A. McCormack, and J. R. Yates. 1994. An approach to correlate 
tandem mass spectral data of peptides with amino acid sequences in a 
protein database. J, Am. Soc. Mass Spectrom. 5:976-989. 

9. Figeys, D., A. Ducret, J. R, Yates, and R. Aebersold. 1996. Protein identifi- 
cation by solid phase microcxtraction-capillary zone clectrophorcsis-micro- 
electros pray-tandem mass spectrometry. Nat. Biotechnol. 14:1579-1583. 

10. Figeys, D., I. VanOostveen, A. Ducret, and R. Aebersold. 1996. Protein 
identification by capillary zone electrophorcsis/microelectrospray ionizat ion- 
tandem mass spectrometry at the subfcmtomole level. Anal. Chem. 68:1822- 
1828. 

11. Eraser, C. M., and R. D. Fleischmann. 1997. Strategies for whole microbial 
genome sequencing and analysis. Electrophoresis 18:1207-1216. 

12. Garrels, J. I., B. Futcher, R. Kobayashi, G, I. Latter, B. Schwender, T. Voipe, 
J. R. Warner, and C. S. McLaughlin. 1994. Protein identifications for a 
Saccharomyces cerevisiae protein database. Electrophoresis 15:1466-1486. 

13. Garrels, J. I., C. S. McLaughlin, J. R. Warner, B. Futcher, G. L Latter, R. 
Kobayashi, B. Schwender, T. VoIpe, D. S. Anderson, F. Mesquita-Fuentes, 
and W. E. Payne. 1997. Proteome studies of Saccharomyces cerevisiae: iden- 



tification and characterization of abundant proteins. Electrophoresis 18: 
1347-1360. 

14. Gygi, S. P., and R Aebersold. 1998. Absolute quantitation of 2-DE protein 
spots, p. 417-421. In A. J. Link (cd.), 2-D protocols for proteome analysis. 
Humana Press, Totowa, N.J. 

15. Harford, J. B., and D. R. Morris. 1997. Post-transcriptional gene regulation. 
Wiley-Liss, Inc., New York, N.Y. 

16. Hereford, L. M., and M. Rosbash. 1977. Number and distribution of poly- 
adcnylatcd RNA sequences in yeast. Cell 10:453-462. 

17. Hodges, P. £., W. £. Payne, and J. L Garrels. 1998. The Yeast Protein 
Database (YPD): a curated proteome database for Saccharomyces cerevi- 
siae. Nucleic Acids Res. 26:68-72. 

18. Klose, J., and U. Kobalz. 1995. Two-dimensional electrophoresis of proteins: 
an updated protocol and implications for a functional analysis of the ge- 
nome. Electrophoresis 16:1034-1059. 

19. Kurland, C. G. 1991. Codon bias and gene expression. FEES Lett. 285:165- 
169. 

20. Lashkari, D. A., J. L. DeRisi, J. H. McCusker, A. F. Namath, C. Gentile, S. V. 
Hwang, P. O. Brown, and R. W. Davis. 1997, Yeast microarrays for genome 
wide parallel genetic and gene expression analysis. Proc. Natl. Acad. Sci. 
USA 94:13057-13062. 

21. Liang, P., and A. B. Pardee. 1992. Differential display of eukaryotic messen- 
ger RNA by means of the polymerase chain reaction. Science 257:967-971. 

22. Link, A, J., L. G. Hays, E. B. Carmack, and J. R. Yates IIL 1997. Identifying 
the major proteome components of Haemophilus influenzae type-strain 
NCTC 8143. Electrophoresis 18:1314-1334. 

23. Nawrocki, A., M. R. Larsen, A. V. Podtelejnikov, O. N. Jensen, M. Mann, P. 
Roepstorff, A. Gorg, S. J. Fey, and P. M. Larsen. 1998. Correlation of acidic 
and basic carrier ampholyte and immobilized pH gradient two-dimensional 
gel electrophoresis patterns based on mass spcctrometric protein identifica- 
tion. Electrophoresis 19:1024-1035. 

24. O'Farrell, P. H. 1975. High resolution two-dimensional electrophoresis of 
proteins. J. Biol. Chem. 250:4007-4021. 

24a.OWL Protein Sequence Database. 2 August 1998, posting date. [Online.] 
http://bmbsgi11.leeds.ac.uk/bmb5dp/owl.html. [8 January 1999, last date ac- 
cessed.] 

25. Patterson, S. D., and R Aebersold. 1995. Mass spcctrometric approaches for 
the identification of gel-separated proteins. Electrophoresis 16:1791-1814. 

26. Pennington, S. R., M. R. Wilkins, D. F. Hochstrasser, and M.J. Dunn. 1997. 
Proteome analysis: from protein characterization to biological function. 
Trends Cell Biol. 7:168-173. 

27. Shalon, D., S. J. Smith, and P. O. Brown. 1996. A DNA microarray system 
for analyzing complex DNA samples using two-color fluorescent probe hy- 
bridization. Genome Res. 6:639-645. 

28. Sbevchenko, A., O. N. Jensen, A. V. Podtelejnikov, F. Sagliocco, M. Wilm, O. 
Vorm, P. Mortensen, H. Boucherie, and M. Mann. 1996. Linking genome 
and proteome by mass spectrometry: large-scale identification of yeast pro- 
teins from two dimensional gels. Proc. Natl. Acad. Sci. USA 93:14440-14445. 

29. Shevchenko, A., M. Wilm, O. Vorm, and M. Mann. 1996. Mass spectrometric 
sequencing of proteins from silver-stained polyacrylamide gels. Anal. Chem. 
68:850-858. 

30. SIkorski, R. S., and P. Hieter. 1989. A system of shuttle vectors and yeast 
host strains designed for efficient manipulation of DNA in Saccharomyces 
cerevisiae. Genetics 122:19-27. 

31. Tsunasawa, S., J. W, Stewart, and F. Sherman. 1985. Amino-terminal pro- 
cessing of mutant forms of yeast iso-l-cytochrome c. The specificities of 
methionine aminopeptidase and acetyltransferase. J. Biol. Chem. 260:5382- 
5391. 

32. Uriinger, S., K, Kuchler, T. H. Meyer, S. Uebel, and R. Tamp'e. 1997. 
Intracellular location, complex formation, and function of the transporter 
associated with antigen processing in yeast. Eur. J. Biochem. 245:266-272. 

33. Varshavsky, A. 1996. The N-end rule: functions, mysteries, uses. Proc. Natl. 
Acad. Sci. USA 93:12142-12149. 

34. Velculescu, V. E., L. Zhang, B. Vogelstein, and K. W. Kinzler. 1995. Serial 
analysis of gene expression. Science 270:484-487. 

35. Velculescu, V. E., L. Zhang, W. Zhou, J. Vogelstein, M. A. Basrai, D. E. 
Bassett, Jr., P. Hieter, B, Vogelstein, and K. W. Kinzler. 1997. Character- 
ization of the yeast transcnptome. Cell 88:243-251. 

36. Wilkins, M. R., K. L. Williams, R. D. Appel, and D. F. Hochstrasser. 1997. 
Proteome research: new frontiers in functional genomics. Springer- Verlag, 
Berlin, Germany. 

37. Wilm, M., A. Shevchenko, T. Houthaeve, S. Breit, L. Schweigerer, T. Fotsis, 
and M. Mann. 1996. Femtomole sequencing of proteins from polyacrylamide 
gels by nano-electrospray mass spectrometry. Nature 379:466-469. 

38. Yan, J. X., M. R. Wilkins, K. Ou, A. A. Gooley, K. L. Williams, J. C, Sanchez, 
O. Goiaz, C. Pasquali, and D. F. Hochstrasser. 1996. Large-scale amino-acid 
analysis for proteome studies. J. Chromatogr. A 736:291-302. 

39. YPD Website. 6 March 1998, revision date. [Online.] Proteome, Inc. http: 
//www.proteome.com/YPDhome.html. [8 January 1999, last date accessed.] 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 



Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SffiES 

□ FADED TEXT OR DRAWEVG 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ <^RAY SCALE DOCUMENTS 



Ul LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 



IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



BEST AVAILABLE IMAGES 




